Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1304 |
Symbol | |
ID | 8136631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1531690 |
End bp | 1533021 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644868918 |
Product | anaerobic C4-dicarboxylate transporter |
Protein accession | YP_003021122 |
Protein GI | 253699933 |
COG category | [R] General function prediction only |
COG ID | [COG2704] Anaerobic C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000000000000133068 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGCGA TGTTTTGGAT CCAGTTCGTA TTGGTGCTGG GAGCGGTGTT GTTAGGTATT CGCAGAGGCG GAGTGGCCCT GGGGCTCATC GGCGGGCTTG GCGTGTCGGT GCTGGTGCTC GGGTTCCGCA GTGTCCCTTC CGAGCCCCCG ATCGCCGTGA TGCTTATCAT CCTCGCGGTG GTCACGGCGT CGGCCACCTT GCAGGTGGCG GGCGGTCTGG ACTACCTGGT GCAACTGACC GAGAAGTTGC TGCGCGCCCA TCCCAAGTAC GTTACCGTCC TGGCCCCTTT GTCCACCTTC TTCCTTACCG TATGTTGCGG CACCGGCCAT GCCGTGTATG CCCTGCTTCC GGTCATTTCG GACGTTGCCC TGAAGACGAA GATCCGCCCC GAGCGCCCGA TGGCGATCTC GAGCGTCGCC TCGCAGATGG GTATCACCGC CAGCCCGGTT GCGGCAGCAG TCACCTTCTT CCTCGGCTTC GCCGCCAAGG CAGGGTATCC CGTCACCCTC ATCGACATCA TCTCCGTCAC CATGCCTGCC GGCATCATCG GCGTGCTGGC AGCCGCCGCC TGGAGTTTCA ACCGCGGCAA GGACCTCGAC AAGGACCCGG AGTTCCAGGC CCGTCTCGAG GATCCCGAAT TCCGCCAGGC GCTCGACGGT GAAGTGACCA CCCTCGGCAT GAAGATCTCC ACCACCGCCA AGGTTTCCGT GGCCCTGTTC TTTGCCGGCG TGGGCAGCAT CATCCTTATC GCCAGCTGTC CCTGGATTCT CCCTTTGACC GCTGCAGGCA AACCGATCGC CATGACCACC GTGGTCCAGT TCGTCATGCT TGCCTTTGGC GCCTTCATCA TGTTCAGCGC CAACGTCAAG GCCAAGGAAA TAGCCCACTC CAGCGTCTTC ACCGCCGGCA TGATCGCCGT CGTGTCCATC TTCGGCATCG CCTGGATGAG CGATACCTTC ATCACCGCCA ACAAGAAGTT CCTGGTCGAC AACATCGGCG TGATGGTGAA GATGGCACCT TGGACCTTCG CCATCGCCAC CTTCTGCATC TCCGCCTTCG TCAAAAGCCA GGCAGCGACC TTGGCCATCA CGCTTCCCCT GGGGCTCGCC CTCGGGCTCC CCGTCCCGCT ACTGCTCGGC CTGATGCCGG CAAGCTACGC CTACTTCTTC TTCGCCTTCT ACCCCAGCGA CCTGGCCGCC ATCAACATGG ACCGCACCGG CACGACCAGG ATCGGGAAGT ACCTCCTGAA CCACAGCTTC ATGATTCCCG GGCTGATCGG TGTCAGTGTC TCGACCGTGG TCGCCTACGC CATCTCCCAG TTTCTTCTCT AA
|
Protein sequence | MAAMFWIQFV LVLGAVLLGI RRGGVALGLI GGLGVSVLVL GFRSVPSEPP IAVMLIILAV VTASATLQVA GGLDYLVQLT EKLLRAHPKY VTVLAPLSTF FLTVCCGTGH AVYALLPVIS DVALKTKIRP ERPMAISSVA SQMGITASPV AAAVTFFLGF AAKAGYPVTL IDIISVTMPA GIIGVLAAAA WSFNRGKDLD KDPEFQARLE DPEFRQALDG EVTTLGMKIS TTAKVSVALF FAGVGSIILI ASCPWILPLT AAGKPIAMTT VVQFVMLAFG AFIMFSANVK AKEIAHSSVF TAGMIAVVSI FGIAWMSDTF ITANKKFLVD NIGVMVKMAP WTFAIATFCI SAFVKSQAAT LAITLPLGLA LGLPVPLLLG LMPASYAYFF FAFYPSDLAA INMDRTGTTR IGKYLLNHSF MIPGLIGVSV STVVAYAISQ FLL
|
| |