Gene GM21_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1741 
Symbol 
ID8137072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2026667 
End bp2027914 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID644869353 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003021553 
Protein GI253700364 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones129 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCA GTGATGAAAG CACGAAAGTG GCAATGCGGC AGGTATTCGG CCTGCCGGTG 
ATAGTGGCGG CCCTGGGGTA CTTCGTCGAC ATCTACGACC TGGTGCTGTT CAGCATCGTC
AGGGTCCCAA GCCTCAAGGC GATCGGGCTC TCCGGGCAGG AACTGATCGA CAAAGGGGTG
TTCCTGCTCA ATATGCAGAT GGCGGGCATG CTGCTGGGGG GGATCCTCTG GGGGGGCCTC
GGCGACAGAA AGGGTCGCCT CAAGATCATG TTCGGCTCCA TCTTCATCTA CTCGCTGGCG
AATCTCGCCA ACGGCATGGC CAACTCCATC GAGGCCTACG CCTTTCTCCG CTTCATGGCG
GGGGTGGGCT TGGCAGGCGA ACTCGGCGCC GGCATCACGC TGGTGAGCGA GGTACTGCAC
CGCTCCGTCA GGGGGTACGG CACCATGATC GTGGCGACGG TCGGGGTATC GGGTGCCATC
CTCGCCAACA TCGTCGCCAA GGAGTTCGAC TGGCGCACCG CCTTCGTCAT CGGCGGCATC
CTGGGCCTTT TGCTCCTGCT GCTGCGGGTA ACGGTCGCCG AATCCGGGAT GTTCAAGGGG
ATGGAATCAA AGGAGGTCGC CAAGGGGAAC TTCCTCGCCC TCTTCACCTC GCGCGACCGC
TTCGGCCGCT TCATGAATTC CATCCTGATC GGCCTCCCCT CCTGGTTCGT GGTGGGGGTC
CTGATCACCT TCTCCCCCGA ATTCGCCAAG GCCCTCGCGG TCCAGGGAAC GGTCAGCGCC
GGCAACGCGG TCATGTACTG CTACATGGGG CTCGTGGCCG GCGACCTCGT CAGTGGGCTA
TTGAGCCAGT TGCTGAAAAG CCGCAAGAAG GTGGTGCTCC TTTTCCTACT CTTGACCGTC
GCGGCGGTAG CGGGCTACTT CAGCGCCGCC GGGGTTTCCG CCGGCTCCTT CTACCTCATC
TGCGGCTTGC TCGGCTTCGG TATCGGCTAC TGGGCCATCT TCGTGACCGT GGCGGCGGAG
CAGTTCGGAA CCAACCTGAG GGCCACCGTC GCCACCACCG TCCCCAACTT CGTGCGCGGC
ATGACCATCC CCATCACCAT GCTGTTCCAG GCGGCAAGAA AGGTCCTCGG GCTGGAAATG
GGCGCCCTTG CCGTCGGGGC GCTTTGCCTC GTCATCGCGC TGATAAGCCT TTCCCTGCTG
CAGGAGACTT TCCACAAGGA TCTCGATTAT TTCGAGGAGT ACCTCTAA
 
Protein sequence
MTSSDESTKV AMRQVFGLPV IVAALGYFVD IYDLVLFSIV RVPSLKAIGL SGQELIDKGV 
FLLNMQMAGM LLGGILWGGL GDRKGRLKIM FGSIFIYSLA NLANGMANSI EAYAFLRFMA
GVGLAGELGA GITLVSEVLH RSVRGYGTMI VATVGVSGAI LANIVAKEFD WRTAFVIGGI
LGLLLLLLRV TVAESGMFKG MESKEVAKGN FLALFTSRDR FGRFMNSILI GLPSWFVVGV
LITFSPEFAK ALAVQGTVSA GNAVMYCYMG LVAGDLVSGL LSQLLKSRKK VVLLFLLLTV
AAVAGYFSAA GVSAGSFYLI CGLLGFGIGY WAIFVTVAAE QFGTNLRATV ATTVPNFVRG
MTIPITMLFQ AARKVLGLEM GALAVGALCL VIALISLSLL QETFHKDLDY FEEYL