Gene Cmaq_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0206 
Symbol 
ID5709701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp243755 
End bp244807 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content45% 
IMG OID641274709 
Productcellulase 
Protein accessionYP_001540045 
Protein GI159040793 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCG CAACATTATC TAAACTAACC CTTGAGATTG GCCCATCAGG TTTCGAAGAT 
AGGGTGATAA GAACAATAAT AAGCATGATT AGAAATCGTG TTGATGAAGT TAATGTAGAT
AACATGGGTA ACCTAATAGC GAGGATTGGT AATGGTCCTT TTAAACTAAT GATAAGTGCC
CATGCTGATG AAGTAGGTGT TATGGTTTCA CACATTGATC AAAGAGGCTT CATTAAGGTT
GTTCCAATAG GTGGGATTGA TCCATGGGTT ATGATTGAGC AGGAGTTAGT TTTCATGGGA
CGTAACGGTG ACATATATGG CACTGTTGGT GTTGATCCAC CGCACTTAAG GAGGGATAAG
CCTCCATCCA GGTTTGAGGA GCTTTACGTT GATGCCGGCT TCACCTCTAA TGATGAAGCC
TTTAAGGCAG GTATATTACC TGGTGTGGCA GGGACCTTTG CGGCGTCATT TAGGGAGAGG
GGCAGTGTAG TAATAGGTAA GGCGTTAGAT AATAGAGTCG GCTGCAGTGT ACTTGTGGAT
TTAGCTGAGG AGGCTGGGGG AATGGTTACC GGTGACTTAT CCCTTTACCT GGTTTGGAAT
ACGCAGGAGG AGGTTGGGTT AAGGGGTATA AATGCGGCTG TTAACGCCAT TAACCCAAAC
ATGGCCATTG TCGTTGAAAC AACCGTTGCC GCAGATGTTC CAACTAATCC CGAGAATGAA
TGGATAACTA GGATAGGTAA TGGTGCTGCA ATTAGGGCTT TAGATAGATC CATGATAACT
AACCCACGGT TACTATCAGC CGTATTGGAG TTAGCATCAT CAAGGGGAAT TAAGTACCAG
GTTCAAGTCA ACCCATATGG TGGCACTGAC GCCGGTGCTA TACATGTCCA CGGTACAGGT
GTACCAACAG TAGTTGTATC TACACCAGCC AGGTATATCC ACACACCCCA CTCAGTGGTT
AATCTCAGTG ATGTTGAGCA GGTTAAGTCA ATGATCACCC TAATAGTGAG GGAACATGCT
GAATTAAGTA GGGTAATGAG GATTCAGGCT TAA
 
Protein sequence
MDAATLSKLT LEIGPSGFED RVIRTIISMI RNRVDEVNVD NMGNLIARIG NGPFKLMISA 
HADEVGVMVS HIDQRGFIKV VPIGGIDPWV MIEQELVFMG RNGDIYGTVG VDPPHLRRDK
PPSRFEELYV DAGFTSNDEA FKAGILPGVA GTFAASFRER GSVVIGKALD NRVGCSVLVD
LAEEAGGMVT GDLSLYLVWN TQEEVGLRGI NAAVNAINPN MAIVVETTVA ADVPTNPENE
WITRIGNGAA IRALDRSMIT NPRLLSAVLE LASSRGIKYQ VQVNPYGGTD AGAIHVHGTG
VPTVVVSTPA RYIHTPHSVV NLSDVEQVKS MITLIVREHA ELSRVMRIQA