Gene GM21_3464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3464 
Symbol 
ID8138836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4007278 
End bp4009464 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content65% 
IMG OID644871084 
Productglycoside hydrolase family 57 
Protein accessionYP_003023244 
Protein GI253702055 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones105 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC CACTTTACGT GTCGTTTCTC TGGCACATGC ATCAGCCTTT CTACAAGGAT 
CCGGTCCGGG GGGAGTACGT CCTGCCTTGG GCTTACCTGC ACGCGGTGAA GGACTACTAC
GACATGCCCG CCATCGTCGA GGCGGTCGAA GGGGCCAAGG TGGTATTCAA CCTGGTCCCC
TCGCTCTTGG AGCAGATCCT CGACTACGCC GGGGGGAAGG CGGTGGATCC CTTCCTGCTG
CGGGCCCGCC CCTCGCCTGC CGAATTGGAC GACGGCGACC GGCTCTTCCT CCTCGAAAAC
TTCTTCTCCG CGAACCGCCA GCGCATGATC GAGCCCTACC CCCGCTACAG GGAGCTCTTC
ACCCGGGCCG GGGAGGGTGC CCCGGGTGCC GCGGCGATGC GCCTTGCCTC CTTCAGCGAG
CAGGACCTCC TGGACCTCCA GGTCTGGTTC TATCTCGCCT GGACCGGCGA GGCGGCGAGG
AGGCGCTTTC CCGTATTCGG GGAGCTGATC CGCAAGGGGA GCCACTTCAG CCAGGAAGAC
AAGGAGCTTC TCCTGGAGAC GCAGCGCGAG CTGATCTCCC AGGTGATACC CCTGTACAAG
AAGCTTCATC AAGAGGGGAA GGTCGAACTC TCGGTCACCC CCTACTTCCA CCCGATCATG
CCGCTTCTGT GCGATTCGGG GATCGCGCGG GTCGCCCTCC CCACGGCGAA CCTCCCCAGC
ATCCCCTTTC GCTACCCCGA GGATGCCAGG GCACAGCTGC TGCACGCGAT CGCCAGCTTC
GAGAGGCTCT TCGGCTTTCC CCCGACCGGA ATCTGGCCCT CGGAAGGATC GGTGAGCGAC
GAGGTGTTGG GCATCATGGC GCAGACGGGG CTTCCCTGGA CCGCCTCGGA CGAACGGGTG
CTGGCCCATA CGTTGCCGGG GGGGCTGGAC CGTGAGCGCG ACCCCTTGTA CCACCCGTAC
ACCTTCAGTA AGGACGGGCG CGAGATCGCG CTTTTCTTCC GGGACCAGGG GCTGTCGGAC
CTGATCGGCT TCACCTACTC CCAGTGGGAG ACCGAAAGGG CGGTGGGAGA TTTCCTGGCG
CGGCTGAAGG AGGTGAGGCA GCGCAACCGC CAGGCCCGCG TGGTGCCGGT GATTCTCGAC
GGCGAAAACG CCTGGGAGTA CTACGCGGAG AACGGCTTTC CCTTCCTCTG CCGGCTCTAC
GCCGCCCTGG TGCAGACCCC CGGGGTGCAA CTCGCCACCT TTTCGGAGGT GCTCTTGCGG
ACAGGCGAGC GGCGCGTCCT GGAGCACGTC CACCCGGGAT CGTGGATCAA CGCCGACTAC
GGCATCTGGA TCGGCAAGCC CGAGGAGAAC CTCGGGTGGG ATTACATCGC CAAGGCGCGT
GCCGCGGCGG TGCAGGGAAA CGCGGAGATG GCGACCCTCC TGGCCGGGGG GGAGAGCAGC
GATGAGGCCG CCCGGCAGGC GTGCATGGCG CTCTACGCAG CGCAGGGGAG CGACTGGTTC
TGGTGGTACG GCGACGATCA CTTCTCCCCG CATGCAGGGA GGTTCGACCT GCTGTTTCGC
AGCCACCTGA TGAACGTCTA TCAGCTGCTG GCCCTGGAGG TTCCCGGCGA GCTGCAGCAG
CCGATAAAGA AAGAACGCCC ACCCGGTTTC GTGCGCGCGC CCGCGGGGCT GGTCACCCCC
GCCATAACGG GTGTGGTGAA CGACTACTTT GAGTGGCTCT CGGCAGGGCT ATACGACCTG
ACCCGGCAGG GGGGCGCCAT GCACGCCGCC GACAACCTGC TGCAGTCCTT CTATTACGGC
TTCGACCTGG AGTACCTCTA TTTCCGCATC GACGGAGTGC AGCCGCTGGA AAAGACGTTC
AGGCCGGAGG ACAGGCTCTC GCTGCATCTG CTCTGCGGTG GGGAGTGGCG GCTCGACATG
CAGTTGGGCG AGGGGGAGGG GGAGTTGCAG GTGCTGAGGG AGGGAGCCTG GCATGGCAGC
GGCAGTATCG GACGCTACTG CATGGGGCGC AGCGCTGGAG CGCGCGTGCC CCTGCCTCCC
TTAAGGCTCG AAGGGGGAGG GACGGTACTT TGCTATCTCT GCGTCACCCG CGCCGGGACC
CAGGTGGGAC GCTGGCCGGC CGACGCTGCA CTCCCCTTGG TCTGCGCCGG GCCGGAACTT
GGATGCGAGG CTCATTATAA TCATTAA
 
Protein sequence
MTEPLYVSFL WHMHQPFYKD PVRGEYVLPW AYLHAVKDYY DMPAIVEAVE GAKVVFNLVP 
SLLEQILDYA GGKAVDPFLL RARPSPAELD DGDRLFLLEN FFSANRQRMI EPYPRYRELF
TRAGEGAPGA AAMRLASFSE QDLLDLQVWF YLAWTGEAAR RRFPVFGELI RKGSHFSQED
KELLLETQRE LISQVIPLYK KLHQEGKVEL SVTPYFHPIM PLLCDSGIAR VALPTANLPS
IPFRYPEDAR AQLLHAIASF ERLFGFPPTG IWPSEGSVSD EVLGIMAQTG LPWTASDERV
LAHTLPGGLD RERDPLYHPY TFSKDGREIA LFFRDQGLSD LIGFTYSQWE TERAVGDFLA
RLKEVRQRNR QARVVPVILD GENAWEYYAE NGFPFLCRLY AALVQTPGVQ LATFSEVLLR
TGERRVLEHV HPGSWINADY GIWIGKPEEN LGWDYIAKAR AAAVQGNAEM ATLLAGGESS
DEAARQACMA LYAAQGSDWF WWYGDDHFSP HAGRFDLLFR SHLMNVYQLL ALEVPGELQQ
PIKKERPPGF VRAPAGLVTP AITGVVNDYF EWLSAGLYDL TRQGGAMHAA DNLLQSFYYG
FDLEYLYFRI DGVQPLEKTF RPEDRLSLHL LCGGEWRLDM QLGEGEGELQ VLREGAWHGS
GSIGRYCMGR SAGARVPLPP LRLEGGGTVL CYLCVTRAGT QVGRWPADAA LPLVCAGPEL
GCEAHYNH