Gene Cmaq_1660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1660 
Symbol 
ID5709271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1734486 
End bp1736840 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content43% 
IMG OID641276168 
Productglycoside hydrolase family protein 
Protein accessionYP_001541473 
Protein GI159042221 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.624656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00115127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAATAC TTAAAAATAT TGAAGTATTG TGTACAATTA TGGATCCCTG TAGAACCATT 
AGTTTCTTCG CCATTGAGAA TGAATCATTA TTGAAGACTG CAGAAGTTAT TTTAACTACA
AAGGTCTCAA TAAGTGGGTT AACGTATAAC TTAACAATAA GACGACTACC ATTAAGTGCA
ATCACCCTTA GTATTGAGTT AACCGAATCC AGTGGGGTTG CTGAATCTAG AAGCATTAAC
ACTGGTGTTG GTGAATACTC AATAATCATT GACACTAATC CATTATCCTT AACAGTGTTG
AAGAGTGGTA GAGTAGTGTT TAGTAATAAG GTTGCACTAG GCCCTGGTTA CTTTCATAGG
GTTAACAATG AATTACATGT ACCATTCTCA ATAAGCCCAG GTGAAGGTAT TTACGGTATG
GGTGAGTGGT TTGGTAGACT TAATAAGGTT GGGCAGAGGC TTGAGGTATA TGTTGTTGAT
CCAGGGGGTT TACCTAATGA TAAAACTTAT GTGGCCTACC CCTTCTTCTG GTCCACTGAA
GGCTATGGGT TACTCATAGA CACTTACTGC CGTGTTATGC TTGACTTCGG TTCAAGATAC
CTTGGAGTGG GGGAATTAAT AATACCCAGT AACGTTAACA TGTATTTGAT CCTATCCAAT
GAACCGAGTA AAATCATTAA GGCATTCTGG GAACTCACAG GGCATCCTGA AGTACCACCC
CTATGGTCCT TCGGCGTCTG GTACAGTTTA TGGAGGGGTT CAGGTTACAT TTACTCCGAT
TATAAGACGC AGGATGATGT TGTTAAATTC GCTGAGGATG TTAGGAGGCG TGGTTTACCC
GGTGATGTTA TTCACATTGA TCCAATATAC ACCATTAGGC CCCTTAGAGC ATCCTTGAAG
AGGTTTCTCA AGAACCTTGG ACTTACTGAT GATGAGATTA AGGAACTTGA GGAGTACCAT
AGGGTTAACG GGAGGTGGAA TTATAGACCA CTCCTGGAAT ACATTAAATC CAAGTGGCCG
GATAAGTATA CTGAGTTAAT TAACAAGTAC CCATTCACAG TCTCAGACTG CACCTTCGAG
TGGCATGATG GTTTCCCAAA CCCTAGGCTA ATGTTCAGTA GGCTTCATGA GCTTGGATTC
AGGGTAAGCA TATGGGTTAA CCCATATGCA GCAGTGGGTA GTGAATGGTT CAGTGAATTA
AGTGAGCGTA ACCTACTGGT TAAGGTTAAT GGTAGGCCCA TGGTTGAGAC ACCAGCCTTC
ATTAAAGGTG AACTAGTTGA CTACAGGATG TTGACTGATG ACTTCGGGGC AGTGGACTTC
ACTAATGATG ATGCCTGCAG GGCTTACTCA GAGAGGATTA GGCAGCTGCT TGAATTAGGT
GCAGACACTA TTAAGACTGA TTATGGTGAA GGAGCCCCTG AAAATGGTGA ATACTCAGTG
GGCATTAACC CATGTATACA TAACCTCTAC CCAGTGCTTT ACAATAAAGT GGTTTACGAG
ACCATAAGGA GCCTTAAGGG TGAGCCCATT GTATGGGGTA GGTCAGGGGG ATTAGGTATT
CATAAGTACC CAATAAGGTG GACCGGGGAC CCTGACTCCA CACCAAGGGG TATGGCTGCG
TCACTCAGGG GTGTGTTATC AATGGCTACC TCGGGAATAA TGTACTCAAG CGTGGATATT
GGCGGCTACG GTGGTAAACC CACGGTTGAG CTTTACGTTA GGTGGGCTCA AATGGGGCTC
CTATTAAGCC ACAGTAGGTT CCATGGAGTA AGTGAGAGGG AGCCTTGGAG TTATGGTGAA
GAGGCCTACA GTATAGTTAA AGGATTCATT AAGCTTAGGT ACTCCCTAAT ACCGTACATT
TACTCCCAAG TCATTGAGGG TTTAAGGACT GGTAAACCAC TCGTTAGGCC ACTTGTAATG
GATTACCCCA GTGATGAGGT CACCAGGGAT ATTGAGGATG AGTATATGCT TGGCGAGTAC
ATGTTAATAG CCCCAGTGTT CTCAGGGGAT GCTAGGTCGG TTTACCTACC TGAGGGGAAT
TGGTACGATT ACTGGAGTAT GAGCATTATT AGGGGTCCAA CCACTATTAA TGTTCACTCA
CCGTTAAGTA GGGTACCAAT CTACGTTAAG GATGGGGCCT TAATAGCGTA CAGCGCGGTT
GATTCAATGA ACTTAACCAC TGATGTACTC AACAACCTAC TTGTTGAGGT TTACGGTAAT
GCCGGTGAAT TTAACATAGA CTTAGGTAGA TACGGTAAGT TAACTGGTAT ACAGGTTACT
TATGATAAGG TTTACTCAAT GGGTGGGTTT AAGATACGCT TCATTAAGGC GTCACCCCAT
GGTCTAAGCT CTTAA
 
Protein sequence
MGILKNIEVL CTIMDPCRTI SFFAIENESL LKTAEVILTT KVSISGLTYN LTIRRLPLSA 
ITLSIELTES SGVAESRSIN TGVGEYSIII DTNPLSLTVL KSGRVVFSNK VALGPGYFHR
VNNELHVPFS ISPGEGIYGM GEWFGRLNKV GQRLEVYVVD PGGLPNDKTY VAYPFFWSTE
GYGLLIDTYC RVMLDFGSRY LGVGELIIPS NVNMYLILSN EPSKIIKAFW ELTGHPEVPP
LWSFGVWYSL WRGSGYIYSD YKTQDDVVKF AEDVRRRGLP GDVIHIDPIY TIRPLRASLK
RFLKNLGLTD DEIKELEEYH RVNGRWNYRP LLEYIKSKWP DKYTELINKY PFTVSDCTFE
WHDGFPNPRL MFSRLHELGF RVSIWVNPYA AVGSEWFSEL SERNLLVKVN GRPMVETPAF
IKGELVDYRM LTDDFGAVDF TNDDACRAYS ERIRQLLELG ADTIKTDYGE GAPENGEYSV
GINPCIHNLY PVLYNKVVYE TIRSLKGEPI VWGRSGGLGI HKYPIRWTGD PDSTPRGMAA
SLRGVLSMAT SGIMYSSVDI GGYGGKPTVE LYVRWAQMGL LLSHSRFHGV SEREPWSYGE
EAYSIVKGFI KLRYSLIPYI YSQVIEGLRT GKPLVRPLVM DYPSDEVTRD IEDEYMLGEY
MLIAPVFSGD ARSVYLPEGN WYDYWSMSII RGPTTINVHS PLSRVPIYVK DGALIAYSAV
DSMNLTTDVL NNLLVEVYGN AGEFNIDLGR YGKLTGIQVT YDKVYSMGGF KIRFIKASPH
GLSS