Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2283 |
Symbol | |
ID | 8137623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2662568 |
End bp | 2663755 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644869898 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_003022090 |
Protein GI | 253700901 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.000000119381 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTTTTA AGAAGATCAG CGCATTGCTC TTGGTAGCGA TGCTGGCGGC CGGCGCATTC GGCTGCAAGA AGAAGGAAGA GGCCCCGGGT GCAGAGGCAG CGAAACCGGC CGGTGACACG GTGAAGATCG GTTTCCTCGG CGCTCTCACC GGCGATGTGG CCATGTTCGG CAAGCCGACC CTCGAAGGTA TGAAGATGGC TGCCGCCGAA CTCAACGCAG CCGGCGGCGT TCTCGGCAAG CAGATCGAGA TCGTAGAGGC CGACAACCGC GGCGACAAGC AGGAAGGCGC CTCGGTCACC CAGAAGTTCA TCTCCCGCGA CAACGTGACC GCCATCGTCG GCGATCCGAC TACCGGCATC ACCAAGGTTG CAGCTCCCAT CGCGCAGAAA GCAGGCGTGG TGCTTCTTTC CGCCGGCGCC ACCGGCCCGG GCGTGGTCGA AGTGGGCGAT TTCATCTTCC GCGACACCCT GCTCGACTCC ATCGCGATTC CCGCCTGCAT CGAGTATTTC GCAAAGGATC TCGGCTTCAA GAAAGTCGCC ATCGTAACTT CCGACAACAA CGACTACTCG GTCGGCCTTT CCCAGACCTT CCGCGATGCA GCCGCCAAGG TCCCCTCGAT CACGATCGTA GCGGACGAGA AGGTGAAGGA CGGCGACAAG GACTTCTCCG CTCAGATCAC CAACATCAAG GGGAAAAAGC CGGACGTCAT CCTCTTCTCC GGTTACTACA CCGAAGGTGC TCTCATCATG AAAGAGGCCC GTAAGCAGGG TCTGAAGGCT TCGATGTTCG GCGGCGACGG ACTCTTCTCG CCCAAGTTCA TCGAGCTCGG CGGCCCGGCT GTCGAGGGCT CCATGTCCGC TCTGGGCTTC TCCACCGAGC AGGCTTCCCC TGCGACCGCC AAATTCATCG AGGCGTTCAA GGCGAAGCAT AACGGCGAAC TCCCGGGCTT GTTCGACGCT CAGGGCTACG ACGCAGTGAT GCTCTTGGCC GACGCCATGA AGCGCGCCAA CAGCGTCGAC GCCAAGGTCT TTAAAGATGC CCTTGCCAAG ACCAAAGGCT TCGAAGGCGT TTCCGGCACC ATCAGCATGC AGGCCAACCG CGAGCCGATC AAGAGCCCGC TCTCCCTTCT CGCTGTGAAG GACGGCAAGT TCGTGCTCAA GGCAAAAGTC CCCGTCAAAA TGGACTAA
|
Protein sequence | MSFKKISALL LVAMLAAGAF GCKKKEEAPG AEAAKPAGDT VKIGFLGALT GDVAMFGKPT LEGMKMAAAE LNAAGGVLGK QIEIVEADNR GDKQEGASVT QKFISRDNVT AIVGDPTTGI TKVAAPIAQK AGVVLLSAGA TGPGVVEVGD FIFRDTLLDS IAIPACIEYF AKDLGFKKVA IVTSDNNDYS VGLSQTFRDA AAKVPSITIV ADEKVKDGDK DFSAQITNIK GKKPDVILFS GYYTEGALIM KEARKQGLKA SMFGGDGLFS PKFIELGGPA VEGSMSALGF STEQASPATA KFIEAFKAKH NGELPGLFDA QGYDAVMLLA DAMKRANSVD AKVFKDALAK TKGFEGVSGT ISMQANREPI KSPLSLLAVK DGKFVLKAKV PVKMD
|
| |