Gene GM21_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0079 
Symbol 
ID8135378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp104013 
End bp105290 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID644867696 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003019924 
Protein GI253698735 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.28688e-24 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGAGT GCCGGCCGGG GGTAATGACA ATGCATGAAC AGGAGCGGCT GCGCCGGCAG 
CGCTGGTTTA TTTTCTTTAT CCTGGCCCTG ATCTACATCA TGGTGTACTT CTATCGCGTC
TCGCTCGCGG TCGTCGCCAA GGACGTCTCG CGCGACCTGA ACCTCACCCC CTCTCAACTG
GGTTCTCTTT CCAGCATCCT CTTCTACGTC TACGCCGCGG CACAGATCCC GCTGGGGCCT
ATGATCGACC GACTCGGAAG CCGGGTGGTC ATCAGCGGCT CCGGGGTGCT CACCGCGCTG
GGCGGCATCC TCTTCTCCCA GGCGGCCAAC ATGGGGCAGG CCGTGGCCGG CCGCGTTCTC
TTAGGGATCG GAACCGCATC GGTGTTGATG GCTACCTTCA CCATCTTCAG CCACTGGTTC
ACCAAGCAGG AATTCGGAAA GGTCTCCGGA ATGATGGTCG CGACGGGAAA CCTGGGGAAC
CTGGCGGGGA CGGCGCCTTT GGCCTTGGCG GTCGCAGCGG TCGGGTGGCG CAACTCCTTT
CTCGCGGCGG GGGTGCTGCA GGCGGTGGTC ACGGTACTGG TTTTCGGCCT GGTGCGAGAC
CGCCCCCCCG TTCCCGATCG GCATGAGGAG GAAGCGCCGG CCCGGCTGGG GATGCTTGAA
GCATGGAGGA AGATCGTCTC CAACGGGGAC TTCTGGCTCT TGGCCGCGGT GGCGTTCGCC
TGGTACGGGA ACTACCTGGC GGTGCAGGGG CTTTGGGGGG GGCCCTACCT GATGGAAGTG
GTGAGGCTTA CCCGGGAGGA GACGGGAAGG ATGCTGATGT ACACCTCGCT GGGGTTCATC
GCGGGGAGCC TCATGATCGA CCACGTGGCG CGCAGGATCC TCCGTTCCTA CAAGAAGACC
CTTCTCGGCG GGCAACTGCT GCTGTTGCTC CTCATGACGA GCTTTCTCGG GCTCGCCGAC
AAGATGCCGA CGGCAGCGCT CTCGGCGCTG TTCTTCGGCC TGGGGCTCTG CGTCTCCAGC
GGCGTGATGA TCTATCCCAT CATCCGCTCC ATGTTCCCGG TAGCCATAGT GGGGACCGCG
CTCACGTCGC TCAACTTCTT CGTGCTGCTG GGGGCCGCAT CGGTGCAGCA GGGGATGGGG
ATAATGATCG GCGCGGTCGC GAAGACGACA CCCGAGGCGA CGGCGCAGGC GTATCATTCG
GCGTTCCAGC TCCCCATCGG GGCGCTGGCG TTCGCCGCGG CCATGTTCTT CTTCGCCAAG
GATTATTGGG AGAAGTAG
 
Protein sequence
MLECRPGVMT MHEQERLRRQ RWFIFFILAL IYIMVYFYRV SLAVVAKDVS RDLNLTPSQL 
GSLSSILFYV YAAAQIPLGP MIDRLGSRVV ISGSGVLTAL GGILFSQAAN MGQAVAGRVL
LGIGTASVLM ATFTIFSHWF TKQEFGKVSG MMVATGNLGN LAGTAPLALA VAAVGWRNSF
LAAGVLQAVV TVLVFGLVRD RPPVPDRHEE EAPARLGMLE AWRKIVSNGD FWLLAAVAFA
WYGNYLAVQG LWGGPYLMEV VRLTREETGR MLMYTSLGFI AGSLMIDHVA RRILRSYKKT
LLGGQLLLLL LMTSFLGLAD KMPTAALSAL FFGLGLCVSS GVMIYPIIRS MFPVAIVGTA
LTSLNFFVLL GAASVQQGMG IMIGAVAKTT PEATAQAYHS AFQLPIGALA FAAAMFFFAK
DYWEK