Gene Rsph17029_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0040 
Symbol 
ID4895081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp49785 
End bp51023 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content68% 
IMG OID640110616 
Productmajor facilitator transporter 
Protein accessionYP_001041932 
Protein GI126460818 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAT CTCCGATCTT CACGCCCGTC CTGATCTCGG GCTGCATCGT CCTGATGCTG 
GGCTTTGCGA TCCGCGCCAG CTTCGGCGTG TTCCAGATCC CCATCGCCGA GGAGTTCGAC
TGGCCGCGGT CCGACTTCTC GATGGCCATC GCGATCCAGA ACCTCGCCTG GGGCATCGGC
CAGCCGATCT TCGGGATGCT GGCCGAGAAG TTCGGCGACC GCCGGGCCAT CGTCGCGGGC
GCGCTCACCT ATGCGGCGGG TCTCGTGCTC TCGAGCTTCG CCGTGACGCC GCTCCAGCAT
CAGTTCCTCG AGGTGCTGGT GGGGTTCGGG ATCGCGGGCA CGGGCTTCGG CGTGATCCTT
GCGGTGGTGG GGCGGGCCAC GGCGCCTGAG CATCGCTCGC TGGCGCTCGG CATCGCCACG
GCTGCGGGGT CGGCGGGTCA GGTCTTCGGG GCGCCCGCGG CCGAGATCCT GCTGGGCTTC
TACAGCTGGC AGACAGTGTT CGTGATCTTC GCGGGCGTCA TCCTTGCCGC GCTCTTTGCG
CTGCCCTTCA TGCGTGCGCC GGTCACCGCG ACGAAGGCCG AGCTCGAGGA GTCGCTCGGC
ACGGTGCTCA GACGGGCCTT CCGCGATCCG TCCTATACGC TGATCTTCGT GGGCTTCTTC
TCCTGCGGCT ATCAGCTGGC CTTCATCACC GCGCACTTCC CCGCCTTCGT GACGGAGATG
TGCGGGGCGA TCGATCCGCG CGGGCCGCTG GCGGCGCTGG GGATCACCAC CACCTCGGCG
CTGGGCGCAC TGGCGATCTC GCTGATCGGG CTGGCCAACA TCGCGGGCAC GATCACCGCA
GGCTGGCTCG GCAAGCGCTA CTCGAAGAAA TACCTGCTGG CCGCGATCTA TACCGGGCGC
ACGCTTGCGG CCGCGCTCTT CATCCTCGTG CCGATGACGC CCACCACGGT CCTTCTCTTC
TCCCTCAGCA TGGGCGCGCT GTGGCTGGCG ACCGTGCCGC TCACGAGCGG GCTCGTGGCC
CATCTCTACG GCCTGCGCTA CATGGGCACG CTCTACGGGT TCGTCTTCCT CAGCCATCAG
CTCGGCAGCT TCATGGGCGT CTGGCTGGGC GGGCGGATGT ATGACATGAC CGGCGACTAT
ACGATGGTCT GGTGGATCGG CGTGGGCGTC GGCGCCTTCT CGGCCATCGT CCACCTGCCC
ATCCGCGAGA CCCGCAGCCC CGCGTTGCAG CCGGCCTGA
 
Protein sequence
MTKSPIFTPV LISGCIVLML GFAIRASFGV FQIPIAEEFD WPRSDFSMAI AIQNLAWGIG 
QPIFGMLAEK FGDRRAIVAG ALTYAAGLVL SSFAVTPLQH QFLEVLVGFG IAGTGFGVIL
AVVGRATAPE HRSLALGIAT AAGSAGQVFG APAAEILLGF YSWQTVFVIF AGVILAALFA
LPFMRAPVTA TKAELEESLG TVLRRAFRDP SYTLIFVGFF SCGYQLAFIT AHFPAFVTEM
CGAIDPRGPL AALGITTTSA LGALAISLIG LANIAGTITA GWLGKRYSKK YLLAAIYTGR
TLAAALFILV PMTPTTVLLF SLSMGALWLA TVPLTSGLVA HLYGLRYMGT LYGFVFLSHQ
LGSFMGVWLG GRMYDMTGDY TMVWWIGVGV GAFSAIVHLP IRETRSPALQ PA