Gene Cmaq_1025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1025 
Symbol 
ID5710380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1074329 
End bp1076386 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content45% 
IMG OID641275526 
Productraffinose synthase 
Protein accessionYP_001540846 
Protein GI159041594 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0334443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTAA GTAACTTAAT TACCAGCATA GTAATAAGTT TTGACGATGG TTCAACCTGC 
AATGTTGATT CACCCTCAGC CTCCTTCAAC CTCTGTGGTT TTGGTTCAGG TGAATTAAGG
GTTACTAGTG ATGAATCAGC ATTACTAATA GGCTTAACCC TTAAGGCAAG TAAACAATTA
AGCAAGTACC CAGTATCACT GGTGCTTAAT CAACCTAAGC CAAGTAGGGT GCTTGCCTTA
ACCTTCCTAG GCCTAGCAGG CCCATTCATC GGTAAGGCCT TCGGTTACTA TAACTACGTT
GCCCAGGGTA AGCAGCCTAG GTCCGAGCCA CCGCCTGATA AACCAGAGTA CCCACCGGGG
GTTAAGGCAA GTGGTTTAGT TGAGTCTGAT CCATTGGACT GCTGGTCCTA CCCAATGCTT
GTTAATAATT ATGGTGAACT ACACCCATAC ACCGTAATGG TGCTTATTGA TTCAGGTAAT
GGATCATACA CAGCATTATT CACCTTCTCC AACAATCAAT TAACCGCATG GCTTGATAAG
GGCCTAGTGA TAAGAACCTA CACCAGTAAG CCCAGTGACG AGGTTAAGTT AAGTTACGTA
GCCTCCATAG CCACAGGCAG TGACCCATAC GATGCAGTGG CTAAGGCTGT TTCCTCAGCC
TCCAGGGTTA CTGTGTTTAA GACGAGGAGT CGTAAGGCTA AGCCCCTATT CATGAATGGG
TTAGGGTGGT GTAGTTGGAA TGCATTACTC AGTGATGATT TAAGCCATGA TAATGTGGTT
AAGATAGTTA AGGGGCTTAG GGATAGGGGA GTACCCATTA GCTGGGTTAT TATTGATGAT
GGTTGGCAGG ACCTTTGGAA TGGTGTAATT AATAGCATTG AGCCAAGTAA GGTGAAGTTC
CCAAGGGGCT TTAAAGCCGT GGTGGATGAG TTAAGGAACC TGGGTGTTAG TAATATTGGA
TTATGGTTCA CCATAAACCT ATACTGGAAC GGCGCCTCTG AAGCCTTCAT TAAGGCTCTT
AACGCTGAGG GCTTTAAGAC AAGTAGAGGC TACGTACCTA AGCCTAACCT TGAGGACTCC
TTCAAGCTCT ATGATGCCTG GTTCAGGGTG CTTAAGAGTA ATGGCTTCAG TTTCGTTAAG
GTTGATAACC AGTGGTCAAT ACACCACTTA TACAGGGGGT TTGCAAATGA TGCCGAGGCC
GCTGCGGCCG TTGAATTAGG CCTTCAATTA GCAGCCACCA CTAATGGTTT AGATGTATTA
AACTGCATGA GCATGCTCCC CGGTAACTAC AGTAACTACG CCATTAGTAA TGCCTTAAGG
GTTTCAATAG ACTACATCCC AATGTGGAGG ACTGACGCTA AGCTTCACAC AATGTGGAGT
GCCTACAATA GCCTACTCTA CAGTAACTTC GGTTACCCCG ACTACGACAT GTGGATTAGC
TACGACCCAT CAGCAAGGCT AATAGCCGTC AGCCGCATAT TCAGCGGTGG CCCAGTATAC
ATTACTGACC GTGAACCTGA GAAAACCAAT GTTGAGTTAA TTAAATGGAT TACACTCAGT
AATGGCGAGG TCATTAGGGT TGATGAACCA GCATTACCAA CTAGGGATAT CTTATTCAGA
GACCCCTACA ATGAGACTGT ACTACTTAAA CTAGCCTCAA CAGTCAATGG TTACCCAGCC
ATCGCCTTCA TGAATGTTAA TAAGAATGGT GTAAGGATTA GTGAGGAGTT TAAACTAGTT
AATATGCCCA TGAAGTTAAA TGGCCAATAC GCATACTACA AGGTGATTAG TGGGGATTGG
GGCATCGTTA AGCCTGATGA TTCAATTAAG GTTGAGTTAA GTGAATTAGA GGCTGAGGTA
GTAGTGCTGG CGCCATTAAT CAACGGTAAG GCTGCATTAG GCATAGTGGA GAAGGCGCTG
CCACCATACG CAATTAAGGC AACCCCAATT AACGGTGAAT TAATGGTGGA GGCTAGGGAG
GATGGAACAA TGGCTTACGT GAAGGAGGGG AGGATGGAGA GAATTAGGGT GAAGGCAGGG
GAGAGGGTAA GAATATGA
 
Protein sequence
MSVSNLITSI VISFDDGSTC NVDSPSASFN LCGFGSGELR VTSDESALLI GLTLKASKQL 
SKYPVSLVLN QPKPSRVLAL TFLGLAGPFI GKAFGYYNYV AQGKQPRSEP PPDKPEYPPG
VKASGLVESD PLDCWSYPML VNNYGELHPY TVMVLIDSGN GSYTALFTFS NNQLTAWLDK
GLVIRTYTSK PSDEVKLSYV ASIATGSDPY DAVAKAVSSA SRVTVFKTRS RKAKPLFMNG
LGWCSWNALL SDDLSHDNVV KIVKGLRDRG VPISWVIIDD GWQDLWNGVI NSIEPSKVKF
PRGFKAVVDE LRNLGVSNIG LWFTINLYWN GASEAFIKAL NAEGFKTSRG YVPKPNLEDS
FKLYDAWFRV LKSNGFSFVK VDNQWSIHHL YRGFANDAEA AAAVELGLQL AATTNGLDVL
NCMSMLPGNY SNYAISNALR VSIDYIPMWR TDAKLHTMWS AYNSLLYSNF GYPDYDMWIS
YDPSARLIAV SRIFSGGPVY ITDREPEKTN VELIKWITLS NGEVIRVDEP ALPTRDILFR
DPYNETVLLK LASTVNGYPA IAFMNVNKNG VRISEEFKLV NMPMKLNGQY AYYKVISGDW
GIVKPDDSIK VELSELEAEV VVLAPLINGK AALGIVEKAL PPYAIKATPI NGELMVEARE
DGTMAYVKEG RMERIRVKAG ERVRI