Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3965 |
Symbol | |
ID | 8139339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4548382 |
End bp | 4549542 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871581 |
Product | iron-containing alcohol dehydrogenase |
Protein accession | YP_003023739 |
Protein GI | 253702550 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0000000175274 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGAAG GTCTCGAACT GAGAAAATTT CTGGCCCCCG AGTTCATTTT CGGCGCCGGA GCGCGGCAGT TGGCCGGCAG GTACGCCAAG AACCTGGGGG GGCGCAAGAT CCTGGTGGTA TCGGACCCCG GGGTAGTCGA GGCGGGGTGG ACCAAGGACG TCACCGACAG CCTGGAGGCC GCCGGTCTTT CCTATGTCCT CTTCACCGCC ATCACCCCCA ACCCCAAGGT CGAGGAAGTC ATGGCCGGGG TGGCGCTCTA CCAGGCGGAG CGCTGCGACC TCTTGGTCGC GGTCGGGGGA GGTAGCCCCA TCGACTGCGC CAAGGGGATC GGCATAGTCA GCACCAACAA GAAGCACATC CTCGATTTCG AAGGGGTGGA CATGGTTACC TCCCCCATGC CGCCCTTGGT CTGCATCCCC ACCACCGGTG GAACCTCCGC CGACGTCTCC CAGTTCGCCA TCATCAGCAA CCCCATGGAG AGGGTCAAGA TCGCCATCAT CAGCAAGTCG GTCGTCCCGG ACATCGCCCT CATCGACCCC GTCACCCTCA CCACCATGGA TCCCTACCTG ACCGCCTGCA CCGGGCTCGA CGCCATGACC CACGCCATCG AAGCCTTCGT CTCCACCGCC CGCTCCGGCA TGACCGATCT GCACGCCCTG GAGGCGCTGC GCCTGCTCTC GGCGAGCCTC GTCCCCAGCA TCCGCAACCC CGAGGACCTG AACCTGCGCG GCGACGTCAT GATGGGGAGC CTGCAGGCCG GGCTCGCCTT CTCCAACGCC ATCCTCGGGG CCACCCACGC CATGGCGCAC AGCCTCGGGG GCGCGCTCGA CCTGGCACAC GGGGAGTGCA ACGCCATCCT GCTGGACCAC GTCATAGAGT TCAACTTCGC CGCGTCGCCG GAGCGGTTCG AGCGGATAGC GCAGGTCATG GGGCTCGACC TGCGGGGCCT CCCGACCCAG GAGAAGCAAA AGGCACTCTT GCGGCATGTC AGGGAGCTGA AGGCGCAAGC GGGGGTGGCC CGGACCCTGG CCGAGGTAGG GGTGGGGCTG AGCGACCTCT CCCTTTTCAG CGAGCATGCC CTCAAAGACC CCTGCATGGC GACCAACCCG CGCCGCCCCT CCAAGAGGGA CATCGAGGTC GTATATGAAG AAAGCCTCTG A
|
Protein sequence | MAEGLELRKF LAPEFIFGAG ARQLAGRYAK NLGGRKILVV SDPGVVEAGW TKDVTDSLEA AGLSYVLFTA ITPNPKVEEV MAGVALYQAE RCDLLVAVGG GSPIDCAKGI GIVSTNKKHI LDFEGVDMVT SPMPPLVCIP TTGGTSADVS QFAIISNPME RVKIAIISKS VVPDIALIDP VTLTTMDPYL TACTGLDAMT HAIEAFVSTA RSGMTDLHAL EALRLLSASL VPSIRNPEDL NLRGDVMMGS LQAGLAFSNA ILGATHAMAH SLGGALDLAH GECNAILLDH VIEFNFAASP ERFERIAQVM GLDLRGLPTQ EKQKALLRHV RELKAQAGVA RTLAEVGVGL SDLSLFSEHA LKDPCMATNP RRPSKRDIEV VYEESL
|
| |