Gene Cmaq_0950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0950 
Symbol 
ID5708834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp998225 
End bp999379 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content44% 
IMG OID641275451 
Productcellulase 
Protein accessionYP_001540772 
Protein GI159041520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.374031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAATT ATAAAGCTAA GTTAAGTGTA GTCATTATGA GTAATGAGGA ATTCATAAAC 
CTACTCAAGG AACTCTCCGA GGCGTTCGGT CCCTCTGGCT TTGAGGATGA GGTTAGGGAA
TTAGTGATTA AGGAAATGGA ACCTTACGTG GATGAATTGG AGGTGGATAA ATGGGGTAAT
GTTATTGGTG TTAGGTACGG TAGTAGGAGG GATCTTAAGG TAATGATTGC AGCCCATATG
GATGAGATAG GTTTACTTAT TGATAGTATT GATAAGAATG GTTTCCTAAG GTTTAGGGGT
ATTGGGGGAT GGAATGAGGT AACTTTAGTT GGTCAAAGAG TAATCATTAA GACTCAGGAT
GGTGGAAAAA TAAAAGGTGT AGTGGGTAGT AGGCCTCCGC ATGTGACGCC TCCAGGAAAG
GAGAGGGAGG CTCCGGAGAT GAAGGAATTA TTCATTGATA TTGGGGCAAG TGATTCAAGT
GAGGTTGAGA AACTTGGGGT GAGGGTTGGT TCAGTGGCGG TATTGGATAG GTCCTTTGAG
GTCCTTAATA ATGATACTGT AACCGGTAAG GCATTTGATG ACAGGGTTGG GTTAGCTGTA
ATGCTGTGGA TGCTTAGGCA ATTAAAGAAC CATGAGGTAA CAGTGTACAC TGTAGCCACT
GTTCAAGAGG AGGTTGGGTT AAAGGGTGCG CAGGTTGCCG CTGACAGGGT TTACCCAGAC
TTCGCAATAG CCTTAGATAC AACCATAGCT GCTGACGTAC CTGGTGTATC TGAGCGTGAA
TACGTGTCTA GGCTTGGTGC AGGTCCAGCA TTAAAAATAA TGGATGGTGG AAGGGGCGGC
TTATTCATAG CCCACCCAGG CTTAACTAAC TACATTATTA ATATAGCTAA GGCTAATAAT
GTGCCGTATC AATTAGAGGT ATTGATTGGA GGCACCACTG ATGCTGCTGG TATAGCCTTA
AGGAGGGATG GAATACCTGC AGCAACAATC TCAATACCCA CCAGGTACGT TCACTCACCT
GTGGAGGTGC TTAAGGTAAG TGATGCAGTT AATGCATCAA GACTACTCAC GCTAGTGGTT
CAAGGGGCTA ATGAAGGGTT AATAAGCAGT CTTAGGAGTA GGGTGATTAA GGGTGTTGGG
TTTAAGGTGA CTTGA
 
Protein sequence
MNNYKAKLSV VIMSNEEFIN LLKELSEAFG PSGFEDEVRE LVIKEMEPYV DELEVDKWGN 
VIGVRYGSRR DLKVMIAAHM DEIGLLIDSI DKNGFLRFRG IGGWNEVTLV GQRVIIKTQD
GGKIKGVVGS RPPHVTPPGK EREAPEMKEL FIDIGASDSS EVEKLGVRVG SVAVLDRSFE
VLNNDTVTGK AFDDRVGLAV MLWMLRQLKN HEVTVYTVAT VQEEVGLKGA QVAADRVYPD
FAIALDTTIA ADVPGVSERE YVSRLGAGPA LKIMDGGRGG LFIAHPGLTN YIINIAKANN
VPYQLEVLIG GTTDAAGIAL RRDGIPAATI SIPTRYVHSP VEVLKVSDAV NASRLLTLVV
QGANEGLISS LRSRVIKGVG FKVT