Gene GM21_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1737 
Symbol 
ID8137068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2022353 
End bp2023594 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID644869349 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003021549 
Protein GI253700360 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones122 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAG CCACCCCACC TCTCACGGCC AAAATCCACT ACGGCTGGAT CATCGTCGCC 
ACCAGCGCCC TGGGCCTTTT TTCCTGCTTC GGCCTGGCCC GTTACGCATA CTCCATGCTC
ATCCCCGGGA TGCAGGCGGG CCTTGCCCTA AGCTACGACC GTATGGGGTT CATCGGCACG
GCCAATTTCG TCGGCTACCT GGCCTCCGTC CTGGCGGCGC CAAAGCTGAT GGGGCGGCTG
CCCCCCCGGT GGATGGCGGC TTTAGCCCTT TTCGTTATCG GCCTCGGCAT GATCGGTATC
GGCTTTTGCA CCTCTTTTTT CCCGATAATT GCCCTGTACG CGTTGGTGGG GATGGGAAGC
GGATTCACCA ACATCCCCCT CATGGCGCTG GTCACCTTCT GGTTCCGCAG CGAGCATCGC
GGCAAGGCAG CGGGCCTAGC CATCGCAGGG AACGGGATAG GAATCATCCT CGCCGGGTTC
CTGGTTCCCG CACTCAACCG CAGCTTCGGG CCAGACGGCT GGCGCGCCGG CTGGATGGTG
CTGGGGTCGA TATGCCTCGG CATCGCCCTC GTCACCGCTG TTTTGCTGCG AAACCACCCC
TCAGAGCTGG GGCTTGAGCC GGTGGGAAGG GTGGTGGATG CGTCCCCGGA GCAGTTCATC
CACCGCGAGC ACAAGGGGGA CGGCGCGCTC TTGCTCAGGC TGGGGCTTCT CTACCTGGTT
TTCGGCGCCA CCTTCATGGT GTACGGCACC TTCATCGTCA CCACCATGGT GCGGGAGTAC
GGGCTAGCCG AGGCGCGGGC CGGGCTCTAC TGGTCCTGGG TAGGCTTCTT CAGCTTTTTC
TCCGGCATCG GCTTCGGCAC CCTGTCCGAC CGCATCGGCA GGCGCCGTGG GCTTGCCTTG
GTCTTCACCG TTCAGACCGC GGCCTACCTG CTTGCGGGTC TTAAAGCCGG TCTCTTGGGC
CTCACCGTAT CGCTCGTGCT CTACGGCTGC GCCGTCTTTG CCATCCCCGC CATCATGGCG
GCCGCGGTCG GCGATTACCT GGGGCTGAAC CGGGCATCCG CCGCCTTCGG CACCATCACC
ATCTTCTTCG GATTGGGGCA GGTCATCGGC CCCGCCGGAG CCGGGATGAT CGCCAAGTCC
ACCGGCGCCT TCACCACCCC CTACCTCATA GCCGGGATAC TGACCGCCTG CGCCGCGGTC
CTGGCTTTCC TGCTCCCCGA ACCTGCCGGG AAAAGTGCCT GA
 
Protein sequence
MTKATPPLTA KIHYGWIIVA TSALGLFSCF GLARYAYSML IPGMQAGLAL SYDRMGFIGT 
ANFVGYLASV LAAPKLMGRL PPRWMAALAL FVIGLGMIGI GFCTSFFPII ALYALVGMGS
GFTNIPLMAL VTFWFRSEHR GKAAGLAIAG NGIGIILAGF LVPALNRSFG PDGWRAGWMV
LGSICLGIAL VTAVLLRNHP SELGLEPVGR VVDASPEQFI HREHKGDGAL LLRLGLLYLV
FGATFMVYGT FIVTTMVREY GLAEARAGLY WSWVGFFSFF SGIGFGTLSD RIGRRRGLAL
VFTVQTAAYL LAGLKAGLLG LTVSLVLYGC AVFAIPAIMA AAVGDYLGLN RASAAFGTIT
IFFGLGQVIG PAGAGMIAKS TGAFTTPYLI AGILTACAAV LAFLLPEPAG KSA