Gene Cmaq_0080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0080 
Symbol 
ID5710017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp93529 
End bp95676 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content43% 
IMG OID641274583 
Productglycoside hydrolase family 42 protein 
Protein accessionYP_001539924 
Protein GI159040672 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0820956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTC CGGTTTCCGT ATGGTATGGT GTAGGTAGCG TTACCCCACT GTTTGATGAG 
GAAACCATTA AGAGGGATTT AAAGAACATT AAGGAGGCTG GGTTTAAGTA TGTTAGGGGT
TGGGTTAACT GGAGGGATTC TGAACCAAGA CCCGGGGAGT ATGATTTCAG TGGAGTCGAG
AATCTGCTTA AGACCGCTAA CGACATTGGC TTAAGGGTTA TTCTTCAGGT TTACCTTGAA
TTCGCCCCCG ATTGGTTACC TAAGCTTCAC CCAGACTCAC TATACGTCTC TGAGTCAGGC
AGCGTTATCA TGCCTCAAGG TAGCCCTGGG GTTTGCCTTG ACCACCCTGG TGTTAGGGCT
AGGGCTGAGG AATTCATGAG GAAGCTGGCT CAAGTAGTGA TTAAGTACCC TAACTTCTAT
GTCTGGGACT TATGGAGTGA ACCCAATGTT ATTCAGTGGA TTTACCAACC CACTGGTTGG
CGTGGATTAT TCTGCTACTG CAATTACTCT AAGGGTAGGT TCAGGGATTG GTTAAAGACT
ATATACGGCA ACGTTAACAC GTTGAATAAG GCCTGGCATA GGAGTTACCT GGAGTTTAAT
GATGTTGAAC CACCGCGTTT CGTCTCCCTT CACTTCGCTA GGGATAACAT TGATTGGTTA
ACATTCAATA TAGTTAAGCT TAAGGAGGAC CTTGAATGGA GGGTTAAGGT AATTAGGAGT
ATTGATAATA ATCACCCAGT GGTTAGTCAT AGTCATGGCG GTACCTCAGT GTTCAGTAAT
CCACTCTTCG GTGAACCTGA TGATTGGGAA ATGGCCAGTG TGGTTGATGC ATGGGGTACA
TCATTTTACC CCAAGCACGC AGGTAGGGTT AAGGTTGACC ATGTTCTTGA CTCACTAGTA
CTTGATGCAG CTAGGTCAGC GGCATTAGCC AGTGGTAAAC CATACTGGAT AGGGGAGCTT
CAAGCTGGTC AAGGTGTTGG TGGGCTTAAG GCGGTTGAAC CAGTGACACC TGATGATGTG
GCTCTTTGGA TGTGGCAGGC GATTGCACAT GAGGCTAAGG CAATTAACAT ATATCACTGG
TACCCTATGA TGCTTGGTTT TGAGTCAGGT GGGTATGGGT TAATTAACCC TGATGGTTCA
TTAACCGATA GGGCTAGGAA GGCTGGGGAA ACAGCCAGGG TGATTTACGA GAATAGTGAC
CTATTCCTAA AGGCTAAGTT AATTGACTCA AGTGTGGCTA TACTTTATAA TATTGAGTCG
TATAAGTGGC TTTGGATTGC TCAAAGGCAT AGTAGTGATG TATTATCAAG GTCAATACTG
GGTGTTTATA GGGTTCTCTT CAATAGTAAT TATAATGTGG ATTTAGTATC CATTAGACAG
GTTGAAGGAA ACTTAATTAG TAAGTACAAG GTGCTTATAG CCCCATTATC ACTGGTAATG
ACCCTTAAGG CTGCCCTAGG TTTAAGGAAC TTTGTAAGTA ATGGTGGATT ACTACTGGTT
GATTCAAGGT TCGCAGCGAT TAGGAGTGAC GGTTACATTG ACTCAGGTAC ACCGGCGTAT
GGGTTAAGTG AGGTTATTGG GGGTTATGAG GACGGCTACA TGAGTGTGGA TAAGGTGAAC
TTAAGAATCA CTAGTAATCT AATACCTGGT TTAAAGACCG GTGACTTAAT AATTGGTTCT
AATTACGTTA GCTGGCTTAA TTCAAAGGCT AATGAAGTAG GGGTTAGTGA ATTTAACTTA
AGTAAACCTT CAATAACCAT TAATGATTAT GGTAAAGGGA AGGCAATCTA CGTGGGTACA
AGCATTGGTT TATCCTATGA GGCTAATGGA CCAGGGAGTG GTGTTGGAAA ATTAATAGAA
GGCATCATGA ATATTGCCAT GGTTCAACCA CCTGTTGAAG TTAAGTCAAC CAGGGAGGGT
TACATTGAGG TAAGAATAAT GAGGAGTGGT GCTGATTATT TACTCTTCAT AATAAATCAC
TCCTACGGTG ATCAACTAGT TAATGTGAGA ATTAATGAGA ATGTAATTAA TGTGGGCAAT
GCATCAATTA AGGACCTTGT AACTGGTGCA TCAATAAGTA TTACTAATAA TGAGCTTCAG
TTAACCCTAA GGGGTAGGCA AGTTGTAGTA GGGCTTGTGT CTATTTAA
 
Protein sequence
MSFPVSVWYG VGSVTPLFDE ETIKRDLKNI KEAGFKYVRG WVNWRDSEPR PGEYDFSGVE 
NLLKTANDIG LRVILQVYLE FAPDWLPKLH PDSLYVSESG SVIMPQGSPG VCLDHPGVRA
RAEEFMRKLA QVVIKYPNFY VWDLWSEPNV IQWIYQPTGW RGLFCYCNYS KGRFRDWLKT
IYGNVNTLNK AWHRSYLEFN DVEPPRFVSL HFARDNIDWL TFNIVKLKED LEWRVKVIRS
IDNNHPVVSH SHGGTSVFSN PLFGEPDDWE MASVVDAWGT SFYPKHAGRV KVDHVLDSLV
LDAARSAALA SGKPYWIGEL QAGQGVGGLK AVEPVTPDDV ALWMWQAIAH EAKAINIYHW
YPMMLGFESG GYGLINPDGS LTDRARKAGE TARVIYENSD LFLKAKLIDS SVAILYNIES
YKWLWIAQRH SSDVLSRSIL GVYRVLFNSN YNVDLVSIRQ VEGNLISKYK VLIAPLSLVM
TLKAALGLRN FVSNGGLLLV DSRFAAIRSD GYIDSGTPAY GLSEVIGGYE DGYMSVDKVN
LRITSNLIPG LKTGDLIIGS NYVSWLNSKA NEVGVSEFNL SKPSITINDY GKGKAIYVGT
SIGLSYEANG PGSGVGKLIE GIMNIAMVQP PVEVKSTREG YIEVRIMRSG ADYLLFIINH
SYGDQLVNVR INENVINVGN ASIKDLVTGA SISITNNELQ LTLRGRQVVV GLVSI