Gene GM21_3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3142 
Symbol 
ID8138493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3646338 
End bp3647642 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content64% 
IMG OID644870746 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003022927 
Protein GI253701738 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones142 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAA CAACATCTAA CAAGGGGTGG CAGGTAGTCA TGGCCGGAAC CGGAATCAAC 
CTGGCGCTCG GGGTGCTGTA TGCCTGGAGT ATCTTCAAGG GGGCCATCAA GTCGTCCATC
GAAGAGGGCG GCCCCGATGC GTTCCAGTGG AGCCTCGGTT CCATCAACGA CCCGTACGCT
CTTTGCTGTC TCGCCTTTGC TTTCTCGATG ATCCTCGCAG GCAAGTGCCA GGACAAGATC
GGCCCCGCGA AGACCGCTCT CATCGGCGGC ATCCTGGTCG GCGCCGGCTT CTGCCTCATG
GGCTACTCCA ACAGCTACGC AGCCTGGGTC ACCGGCTTCG GCGTCCTGGC CGGCTCCGGC
TTCGGTTTCG GCTACTCCGC CGCCACCCCT CCGGCGCTCA AATGGTTCTC GTCCAAGAAG
ACCGGTCTCA TCGCCGGCAT CGTGGTCGCC GGTTTCGGGC TCGCGCCGGT GTACATCGCG
CCGCTCTCCA GCTACCTTTT GGGCGCGTAC GGCATCCAGC AGTCGATGTA CATCCTCGCC
GCCGGTTTCG CCGTCATCGT CTGCGGCCTC TCCTTTGTAC TGGTAAACCC GCCCAAAGGG
TTCGTTCCCG CCGAACCGGT GATCAAGGGC GAAGAAGGGA ACCCCGCCCC CGCCAAGAAG
GCCGTGCACG ACGCGACCGT CGCCGAGATG CTCCGCTCTC CCAAGTTCTA CATGCTGTGG
ACCACCTTCT TCATCGGTGC GGGCGCAGGA CTTATGGTGA TCGGCTCCGT GGCGGGCCTC
GCCAAGAAGA GCATGGGCCC CATGGCATTC GTCGCCGTGG CCATCATGGC GATCGGCAAC
GCCGCCGGCC GCGTCGTTGC CGGCGTTCTT TCCGACAAGA TCGGCCGCCG CGCCACCCTG
ACCATCATGC TCAGCTTCCA GGCGGTGCTG ATGTTCGCTG CCGTCCCCGT CGTCGGCTCC
GGTTCCGCGA CCCTGCTGGT GCTCCTGGCC TCCCTAATCG GCTTCAACTA CGGCTCCAAC
CTGACCCTCT TCCCCTCCTT CGCCAAGGAC TACTGGGGCT TCAAAAACTA CGGCCTCAAC
TACGGCGTAC TGTTCAGCGC CTGGGGCGTA GGCGGCATGG TGATGGGGCG TGTCTCCGAG
ATGATGAACG CGCAGCCCGG CGGTCTGAAC AAGTCCTTCA TCCTGGCGGG TTCCTGCCTT
GCCATGGGCA CCATCGTCAC CTTCTTCCTG CGTGAGAAGA AGGCGGTCGC GGTCGAGGCT
GCCGAGGTGG TCGGAGAGAA GGTCGCGGTC AAGGTTTCCG CCTAG
 
Protein sequence
MSKTTSNKGW QVVMAGTGIN LALGVLYAWS IFKGAIKSSI EEGGPDAFQW SLGSINDPYA 
LCCLAFAFSM ILAGKCQDKI GPAKTALIGG ILVGAGFCLM GYSNSYAAWV TGFGVLAGSG
FGFGYSAATP PALKWFSSKK TGLIAGIVVA GFGLAPVYIA PLSSYLLGAY GIQQSMYILA
AGFAVIVCGL SFVLVNPPKG FVPAEPVIKG EEGNPAPAKK AVHDATVAEM LRSPKFYMLW
TTFFIGAGAG LMVIGSVAGL AKKSMGPMAF VAVAIMAIGN AAGRVVAGVL SDKIGRRATL
TIMLSFQAVL MFAAVPVVGS GSATLLVLLA SLIGFNYGSN LTLFPSFAKD YWGFKNYGLN
YGVLFSAWGV GGMVMGRVSE MMNAQPGGLN KSFILAGSCL AMGTIVTFFL REKKAVAVEA
AEVVGEKVAV KVSA