Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1039 |
Symbol | |
ID | 8136361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1222591 |
End bp | 1223910 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868650 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_003020858 |
Protein GI | 253699669 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 0.681651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA AGAAGTTTTA CCAGCATCTG TACTTCCAGG TATTGACGGC GATCTCGTTC GGCGTAGCGC TCGGTTACTA CCTGCCGGAC ACCGGTACCG CGATGAAGCC CTTGGGTGAC GGGTTCATCA AGATGATCAA GATGATCATC ACCCCCATCA TCTTCTGCAC CGTGGTCACC GGCATCGCCG GCATGGACGA CATGAAAAAG GTCGGCCGCG TCGGCGGCAA GGCGCTCCTC TACTTCGAGG CGGTCTCCAC ACTGGCGCTT GGCATCGGGC TCGTGGTGAT CAACATTATC CAGCCGGGGG TCGGCATGAA CGCCGATGTC ACGAAGCTTG ACACGAAAGG GCTCGACACC TACACCGCTA CCGCGGCCAA GGGGCACAGC TTCGTGGACT TCGTGCTGGG CGTCATCCCC AACAGCGTGG TCGACGCCTT CGCCAAGGGC GAAATCCTTC AGGTGCTCTT CTTCGCCATC CTGTTCGGCC TGGCGCTCTC CGCCATGGGC GAGAAGGGTA AGCCGGTGTA CCGCCTGATC GACGAGGTGG CGCACGCCTT CTTCGGCGTG GTGAACATCA TCATGAAGTT CGCTCCCATC GGCGCCTTCG GCGCCATGGC CTTCACCATC GGCAAGTTCG GGCTGGGCTC TTTGACCAAG CTCGGGATGC TGATGGGAAG CTTCTACCTG ACCTGCCTGC TCTTCGTCTT CGTGGTGCTC GGCACAATCG GCAAGCTGTG CGGCTTCAAC ATCTTCAAGT TCATCTCCTA CATCAAGGAA GAGCTCCTCA TCGTGCTGGG GACCTCCTCT TCCGAATCCG CGCTTCCCCG CATGATGGCG AAGCTGGAGA ACCTCGGCTG CTCCAAGTCC GTGGTCGGAC TGGTGATCCC CACCGGCTAT TCCTTCAACC TGGACGGCAC CTCCATCTAC CTCACCATGG CCGCGGTGTT CGTGGCGCAG GCGACCAACA CCCCGCTGGA CCTGACCCAG ACGCTGACCA TCCTGGGCGT GCTGATGCTC ACCTCGAAAG GGGCCGCCGG CGTCACCGGC AGCGGTTTTG TTACGCTGGC CGCTACCTTT GCCGCCATCC CCACCATCCC GGTCGCGGGG CTCGCCCTCA TCCTCGGTAT CGACCGCTTC ATGTCCGAGG CGCGTGCCCT CACCAACCTG GTTGGTAACG GCGTCGCTAC CGTGGTCGTC TCCCGCTGGG AGAACGAGCT GGACGTGGCC CGCATGTCGC AGGTGCTGAA CAAGGAATTG GATGAGGCTG ACGGGGAGGC TGATCTGCTG ATGATGGACC CCGAGCCCGA AGAGGCTTAG
|
Protein sequence | MKTKKFYQHL YFQVLTAISF GVALGYYLPD TGTAMKPLGD GFIKMIKMII TPIIFCTVVT GIAGMDDMKK VGRVGGKALL YFEAVSTLAL GIGLVVINII QPGVGMNADV TKLDTKGLDT YTATAAKGHS FVDFVLGVIP NSVVDAFAKG EILQVLFFAI LFGLALSAMG EKGKPVYRLI DEVAHAFFGV VNIIMKFAPI GAFGAMAFTI GKFGLGSLTK LGMLMGSFYL TCLLFVFVVL GTIGKLCGFN IFKFISYIKE ELLIVLGTSS SESALPRMMA KLENLGCSKS VVGLVIPTGY SFNLDGTSIY LTMAAVFVAQ ATNTPLDLTQ TLTILGVLML TSKGAAGVTG SGFVTLAATF AAIPTIPVAG LALILGIDRF MSEARALTNL VGNGVATVVV SRWENELDVA RMSQVLNKEL DEADGEADLL MMDPEPEEA
|
| |