Gene GM21_3965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3965 
Symbol 
ID8139339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4548382 
End bp4549542 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID644871581 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_003023739 
Protein GI253702550 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000000175274 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGAAG GTCTCGAACT GAGAAAATTT CTGGCCCCCG AGTTCATTTT CGGCGCCGGA 
GCGCGGCAGT TGGCCGGCAG GTACGCCAAG AACCTGGGGG GGCGCAAGAT CCTGGTGGTA
TCGGACCCCG GGGTAGTCGA GGCGGGGTGG ACCAAGGACG TCACCGACAG CCTGGAGGCC
GCCGGTCTTT CCTATGTCCT CTTCACCGCC ATCACCCCCA ACCCCAAGGT CGAGGAAGTC
ATGGCCGGGG TGGCGCTCTA CCAGGCGGAG CGCTGCGACC TCTTGGTCGC GGTCGGGGGA
GGTAGCCCCA TCGACTGCGC CAAGGGGATC GGCATAGTCA GCACCAACAA GAAGCACATC
CTCGATTTCG AAGGGGTGGA CATGGTTACC TCCCCCATGC CGCCCTTGGT CTGCATCCCC
ACCACCGGTG GAACCTCCGC CGACGTCTCC CAGTTCGCCA TCATCAGCAA CCCCATGGAG
AGGGTCAAGA TCGCCATCAT CAGCAAGTCG GTCGTCCCGG ACATCGCCCT CATCGACCCC
GTCACCCTCA CCACCATGGA TCCCTACCTG ACCGCCTGCA CCGGGCTCGA CGCCATGACC
CACGCCATCG AAGCCTTCGT CTCCACCGCC CGCTCCGGCA TGACCGATCT GCACGCCCTG
GAGGCGCTGC GCCTGCTCTC GGCGAGCCTC GTCCCCAGCA TCCGCAACCC CGAGGACCTG
AACCTGCGCG GCGACGTCAT GATGGGGAGC CTGCAGGCCG GGCTCGCCTT CTCCAACGCC
ATCCTCGGGG CCACCCACGC CATGGCGCAC AGCCTCGGGG GCGCGCTCGA CCTGGCACAC
GGGGAGTGCA ACGCCATCCT GCTGGACCAC GTCATAGAGT TCAACTTCGC CGCGTCGCCG
GAGCGGTTCG AGCGGATAGC GCAGGTCATG GGGCTCGACC TGCGGGGCCT CCCGACCCAG
GAGAAGCAAA AGGCACTCTT GCGGCATGTC AGGGAGCTGA AGGCGCAAGC GGGGGTGGCC
CGGACCCTGG CCGAGGTAGG GGTGGGGCTG AGCGACCTCT CCCTTTTCAG CGAGCATGCC
CTCAAAGACC CCTGCATGGC GACCAACCCG CGCCGCCCCT CCAAGAGGGA CATCGAGGTC
GTATATGAAG AAAGCCTCTG A
 
Protein sequence
MAEGLELRKF LAPEFIFGAG ARQLAGRYAK NLGGRKILVV SDPGVVEAGW TKDVTDSLEA 
AGLSYVLFTA ITPNPKVEEV MAGVALYQAE RCDLLVAVGG GSPIDCAKGI GIVSTNKKHI
LDFEGVDMVT SPMPPLVCIP TTGGTSADVS QFAIISNPME RVKIAIISKS VVPDIALIDP
VTLTTMDPYL TACTGLDAMT HAIEAFVSTA RSGMTDLHAL EALRLLSASL VPSIRNPEDL
NLRGDVMMGS LQAGLAFSNA ILGATHAMAH SLGGALDLAH GECNAILLDH VIEFNFAASP
ERFERIAQVM GLDLRGLPTQ EKQKALLRHV RELKAQAGVA RTLAEVGVGL SDLSLFSEHA
LKDPCMATNP RRPSKRDIEV VYEESL