Gene Gmet_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_3221 
Symbol 
ID3740976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3621151 
End bp3622401 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content66% 
IMG OID637780508 
Productmajor facilitator transporter 
Protein accessionYP_386159 
Protein GI78224412 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00772282 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATATCA ACGCTCCGTC GCCCCCCTCG AAAAGCTCTC CCTACGCACG CTACGTCCTG 
GCCCTCCTTC TGGGGGTCAA CCTTCTCAAC TACATCGACC GCCAGGTCCT CTATGCCGTC
TTTCCCCTCA TCCAGCACGA TTTCAGCCTC TCCGACACCG CCCTGGGGCT TCTGGGAAGC
GGCTTCATGG TCACCTACAT GGTATCGGCT CCCCTCTTCG GGTGGCTCGG CGACCGCTGG
AGCCGGACCA GGCTCGCCGC GGCGGGCCTC GGGATCTGGA GCGTCGCCAC CGCCGCCGCG
GGCCTTGCCC CCACCTATCC GGCACTGTTG ACTGCCCGCA CCACGGTCGG GGTCGGTGAA
GCGAGCTTCG GCACCGTCTC CCCCGGTCTC CTGGCCGAAT TTTTCGACCG GGAGCGTCGA
GGGCGCATCC TCTCCTACTT CTACCTGGCG ATTCCCGTCG GGAGCGCCCT CGGCTATCTC
CTGGGAGGGG TCATCGGCCA GCAATGGGGA TGGCATGCCG CGTTCATGAT GGTGGGCCTG
CCGGGGCTTC TCCTCGTCCT CCCGGTCTGG CTCATGCGGG AACCGCCCCG CAGCGCCGAT
GCAGCACTAG AGCAGAACGA TAACCCGGAC AACGGCGGCT ACCGCGCCCT GTTCCGGAAC
CGCTCCTTCA TCGCCAACAC CCTGGCCATG GCAGCCATGA CCTTCGCCCT GGGGGGGCTC
GCCCAGTGGA TACCCACCTT CCTCTACCGG GAGCACGGCC TCAGCGTGTC CACCGGTAAC
ACCCTGTTCG GGGGGCTCAC CGTGGTGACC GGCATCTGCG GCACCCTCAC CGGCGGATGG
CTCGGCGACC GCCTCCAGCG CCGCACCCCC AAGGGATACC TCCTGGTCTC GGGCTGGGGG
TTCCTCCTGG GGACGCCGGC GGCGGCCTAC GCCATCCTGA CCCCTTCCCT CAACCACTGC
CTGGGAGGGA TGTTCCTGGC CGAGTTTTTC CTCTTCCTCA ACACCGGCCC CCTCAACACC
GTCATCGTCA ACGTTACCCG CCCGGCCGTC CGCGCCATGG CCTTCGCCGT GAACATCTTC
TTCATCCATG CCCTGGGGGA CGCAATCTCC CCCACCATCC TCGGCCGGCT CTCGGACATC
TGGGGACTCC GCACTGCCCT CCTCTCCACC CCTGTCGCCA TCCTCGTTGC GGCTCTCTTT
GCCTTCGTTT GCTGCCACAG CATCGAAGGG GACATGGCAA AGGCTGAGTA G
 
Protein sequence
MHINAPSPPS KSSPYARYVL ALLLGVNLLN YIDRQVLYAV FPLIQHDFSL SDTALGLLGS 
GFMVTYMVSA PLFGWLGDRW SRTRLAAAGL GIWSVATAAA GLAPTYPALL TARTTVGVGE
ASFGTVSPGL LAEFFDRERR GRILSYFYLA IPVGSALGYL LGGVIGQQWG WHAAFMMVGL
PGLLLVLPVW LMREPPRSAD AALEQNDNPD NGGYRALFRN RSFIANTLAM AAMTFALGGL
AQWIPTFLYR EHGLSVSTGN TLFGGLTVVT GICGTLTGGW LGDRLQRRTP KGYLLVSGWG
FLLGTPAAAY AILTPSLNHC LGGMFLAEFF LFLNTGPLNT VIVNVTRPAV RAMAFAVNIF
FIHALGDAIS PTILGRLSDI WGLRTALLST PVAILVAALF AFVCCHSIEG DMAKAE