Gene GSU0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0752 
Symbol 
ID2687416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp796200 
End bp797540 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content67% 
IMG OID637125424 
Producttransporter, putative 
Protein accessionNP_951809 
Protein GI39995858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCCA GTGCTCAGAT GGAAAAGGTC GGGACGCGGC TACGGCTGAT GCTGCGGGCG 
CTGAATTCGC GGAATTACCG GCTGTTTTTC GCCGGACAGA GCGTGTCGCT GGTGGGCACC
TGGATGCAGC AGGTTGCCAT GAGCTGGCTC GTCTACCGGC TGACCGGCTC GGCACTGTTG
CTCGGGGTGG TCGGCTTCGT CAGCCAGATC CCGACCTTCC TCCTGGCGCC GGTGGCCGGG
GTGCTGGCCG ACCGCTGGAA ACGCCGGCCG CTCCTCCTTG CCACCCAGAC CCTGGCCATG
GTCCAGGCGG CGGTGCTGGC GGTCTTCGTA CTGACCGGGA CCACCCCGGT CTGGCTCATC
GTCGCGCTGA GCGCCCTGCT CGGGGTGGTC AACGCCTTCG ACATCCCGAT CCGCCAATCG
TTCGTGGTGG AGCTGGTGGA GAAAAAAGAA GACCTGGGAA ACGCCATCGC CCTCAACTCG
TCCATGGTCA ACGGTGCCCG GCTGATCGGC CCGTCCATTG CCGGAGTGCT GGTCGCCACC
CTGGGCGAGG GGATCTGTTT CCTGATCAAT GCAGCCAGCT ACCTGGCGGT GATCATCGCC
ATAGCGGCGA TGCGGCTCAA GCCGGTGCCG CAGCGGCCCG GCCGCAAGCA TATCCTCCAT
GAACTGCGCG AAGGATTCGG CTACGCCTTC GACTTCAAGC CGATCCGCTA CATCCTGATG
CTCCTCGGCC TGGTCAGCCT GATGGGGATG CCCTACGTGG TGCTGATGCC GATCTTCGCC
AAGGAGGTCC TGCACGGCGG GGCCCACACC TTCGGCTTCC TGATGGCCTC GGTCGGGATC
GGCGCCTTCG GCAGCACCCT CTACCTCGCC TCCCGCACGA GCGTCCTCGG CCTCGGCCGG
GTGATCGCGG TGGCCGCCTG CGTCTTCGGC TTCGGCATCG CCGGTTTCGC CCTGTCCAGC
TCACTGGTCC TCTCGCCCCT GTTCCTCGCC CTGGCCGGCT TCGGCGCTAT GGCCCAGGTT
GCCTCCAGCA ACACCATCCT CCAGACCATC GTCGACGACG ACAAACGGGG CCGGGTGATG
AGTTTCTTCA CCATGTCGTT CATGGGAGCC ACCCCCATCG GCAGCCTGAT GGCCGGGGCC
GTGGCCAACC GGATCGGCGC CCAGAACACC CTGCTGATCG GGGGTGCCGC CTGCCTGCTC
GGCGGGGCGC TCTTCGGCCG GGAGCTGCGT AACCTCCGCC CCCTGGTCCG GCCCATCTAC
GCCCGGCTCG GCATCATTCC CGAAGTGGCG GCCGGCATGC AGGCCGCTGC TGATCTGACG
TGTCCACCGG AAGATCCCTA A
 
Protein sequence
MGASAQMEKV GTRLRLMLRA LNSRNYRLFF AGQSVSLVGT WMQQVAMSWL VYRLTGSALL 
LGVVGFVSQI PTFLLAPVAG VLADRWKRRP LLLATQTLAM VQAAVLAVFV LTGTTPVWLI
VALSALLGVV NAFDIPIRQS FVVELVEKKE DLGNAIALNS SMVNGARLIG PSIAGVLVAT
LGEGICFLIN AASYLAVIIA IAAMRLKPVP QRPGRKHILH ELREGFGYAF DFKPIRYILM
LLGLVSLMGM PYVVLMPIFA KEVLHGGAHT FGFLMASVGI GAFGSTLYLA SRTSVLGLGR
VIAVAACVFG FGIAGFALSS SLVLSPLFLA LAGFGAMAQV ASSNTILQTI VDDDKRGRVM
SFFTMSFMGA TPIGSLMAGA VANRIGAQNT LLIGGAACLL GGALFGRELR NLRPLVRPIY
ARLGIIPEVA AGMQAAADLT CPPEDP