Gene GSU2744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2744 
Symbol 
ID2685993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3019720 
End bp3020901 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID637127435 
Productmajor facilitator family transporter 
Protein accessionNP_953789 
Protein GI39997838 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCAG GCATCTCCGG CAACGTGCTC ATCCTGGGGG TGGTCAGCTT CCTGACCGAT 
GTCTCCAGTG AGATGATCTA TCCGCTACTC CCCCTCTTCC TGACGACTGT GCTCGGTGCC
GGCCCTGCGT TTCTCGGCGT GATCGAAGGA GTGGCCGAAT CGACCGCGGC GCTGCTGAAG
CTCGTCTCCG GCATCGTGTC CGACCGGACC AGAAGCCGCA AGGGGCTGGT TCTGGGGGGG
TACGCCCTGT CGAGCCTGGC ACGTCCCCTG GTGGCTGCGG CAACGGGCCC GGTCGCGGTA
CTGGCCATCC GCTTCGCCGA CCGGGTGGGG AAGGGGGTGC GGACCTCCCC CCGGGATGCT
CTCATCGCCG ACTCTTCCGA TCCCTCGGTC CGGGGGAAGG CCTATGGCTT TCACCGGGCC
ATGGATCACG CCGGAGCCCT CGTGGGCCCC CTGCTCGCCA CCCTGCTCCT GGCCGGCGCC
GCCCTGGACC TGCGGACCGT GTTCTGGCTC TCGGCCGTAC CAGGGCTTCT GGCCGTACTT
CTCATCATTT TGCGGGTCCG CGACGTGGAG CGGAAGCGAA CCAGCGACGG AAGCGTTCTG
GGGGCGATAC CCAGGGATGG GCTCCGCCGG TACCTGGCCG TCCTCGTCCT GTTCACGCTG
GGCAATTCCT CCGACGCCTT CCTGCTGCTG CGGGCGAGCC AGCTCGGGGT TTCGCCGGCG
CGCATCCCGC TGCTGTGGGC CTTTTTTCAC CTGGTGAAGA TGCTGGCGTC CACTCCCTTC
GGCGCCCTGT CGGACCGGAT CGGCCGTCGG CGGGTCATCG TTGCCGGCTG GGCAGTCTAC
GCCCTGTCGT ACCTGGGGTT TGCCGCTGCC GCATCCGAGC CGGCCTGCTG GCTGCTCTTC
GCCGTCTACG GCACCTTCTA CGGGATGACC GAAGGGACCG AAAAGGCGTT CGTCGCCGAT
CTCGTCCCCG CCGAGGCCCG GGGCGGCGCC TTTGGCTGGT ACCATTTCGC CGTGGGCGTC
GGGGCGCTGC CGGCCAGCGT GCTCTTCGGG TTGATCTGGG AGAGGGCCGG GCAGGGAGCG
GCATTTCTGT TCGGCGCGGC GCTGGCAGCC CTGGCGTCAA TCCTGCTGCT CGTCCTGGTG
AGGGAGGACC CGCCGGAGGC GCAGAGCCCG CGGAGCTCCT GA
 
Protein sequence
MFSGISGNVL ILGVVSFLTD VSSEMIYPLL PLFLTTVLGA GPAFLGVIEG VAESTAALLK 
LVSGIVSDRT RSRKGLVLGG YALSSLARPL VAAATGPVAV LAIRFADRVG KGVRTSPRDA
LIADSSDPSV RGKAYGFHRA MDHAGALVGP LLATLLLAGA ALDLRTVFWL SAVPGLLAVL
LIILRVRDVE RKRTSDGSVL GAIPRDGLRR YLAVLVLFTL GNSSDAFLLL RASQLGVSPA
RIPLLWAFFH LVKMLASTPF GALSDRIGRR RVIVAGWAVY ALSYLGFAAA ASEPACWLLF
AVYGTFYGMT EGTEKAFVAD LVPAEARGGA FGWYHFAVGV GALPASVLFG LIWERAGQGA
AFLFGAALAA LASILLLVLV REDPPEAQSP RSS