Gene RSP_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4200 
Symbol 
ID3711893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007490 
Strand
Start bp50383 
End bp51633 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID640069524 
Productmajor facilitator transporter 
Protein accessionYP_345391 
Protein GI77404819 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.279328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACGA CAAGGACGCC GTCCCTCAGC GAGCTCACGG GGCGGCAGGC CAGCTGGATT 
GCCGGCGGGC TGGTGATGAC ACTGGCGACC ATGCCGGGCC AGACCAACTT CATCGCGCAG
TTCAATGCGG TGCTGCGGGC CGAGTTCGGC CTGAGCAGCG GCCTGTTCGG CGGCCTCTAC
ACGCTCGCGA CCCTGACCAG CGCTACGGGG CTGATCTTCG CCGGGGCCCT GGCCGACCGG
ATCGCGCCGC GGAAGCTGGC CTTGGCGATC ATGGCCGGGC TCGCGGCGAC GGCCCTTCTC
ATGTCTCAGG TCCAGAACCT GCCGCTGCTG GTGGTGGCGC TGGCCCTGCT GCGCTTCTTC
GGGCAGGGGA TGCTGATGCA TGTGGCGCTG ACCGCCATGG CGCGCTGGTT CGACAGGTTT
CGCGGGCGGG CCCTGTCCTT CGCGATGTTC GGCATCACGC TGGGGGATTC AATCCTACCC
TTCATGCTGA CCGTCTCGAT CACGGCCTTC GGCTGGCGGA CCGTCTGGAT CGGCACGGCC
TGCACGCTGG CCCTGGCGTT GATGCCGCTG GTGTTCCTCC TGCTGCGCCG CTCGCCGGAA
GGAGGGGCCG TCCCGGCAGG AGGCCCTGCG CCGGCGGCGA CCGGTCTCGA GTGGCGCCGC
GCGCGCGTGC TGCGGGATCC GCTGTTCTGG GCGATCCTGC CGGGCATCAT GGCGATGCCC
GGGATCGGGA CGCTCTTCAT CTTCCATCAG GCCAATCTGG TGGAGGCGAA AGGCTGGGAT
CTGACCACCT TCACCGCCTT CTTCCCCGTT CTGGCGGTGA CGGTTGCGGC CTCGTCGCTC
GCCGCAGGCG TTCTCGTCGA CCGGCTGGGC GCCTGGCGGC TGATGCCCGT CCTGCTCCTG
CCACTTTCGG CCGCCTGCCT CGTGGTGGCG GCCCTGACCC CGGTCTGGTC CATCCCGCTA
ATCTTCCTCG GCTTCGGTCT AACCCAAGGC GTGATGAACC CCGTCATGGG CGCCGTATGG
GTGGAACTCT ACGGCAGCGC CCACATCGGC GCCGTGCGGT CGCTGGCCAC CGCGGCGCTT
GTCGCGGCCT CGGCAATCGG GCCTGGCCTC GCGGGCTGGC TGCTCGACGC CGGCATCCCC
CTTGAGCGGC AGGCGGTGTG CTACGCCGCG TTCTGCCTTG CCTGCACAGC GATCTACGCG
CTCCTCCAAC CGCGGCTCCG CCGACGAACG GTTGCGAGCG CTACTGGCTG A
 
Protein sequence
MMTTRTPSLS ELTGRQASWI AGGLVMTLAT MPGQTNFIAQ FNAVLRAEFG LSSGLFGGLY 
TLATLTSATG LIFAGALADR IAPRKLALAI MAGLAATALL MSQVQNLPLL VVALALLRFF
GQGMLMHVAL TAMARWFDRF RGRALSFAMF GITLGDSILP FMLTVSITAF GWRTVWIGTA
CTLALALMPL VFLLLRRSPE GGAVPAGGPA PAATGLEWRR ARVLRDPLFW AILPGIMAMP
GIGTLFIFHQ ANLVEAKGWD LTTFTAFFPV LAVTVAASSL AAGVLVDRLG AWRLMPVLLL
PLSAACLVVA ALTPVWSIPL IFLGFGLTQG VMNPVMGAVW VELYGSAHIG AVRSLATAAL
VAASAIGPGL AGWLLDAGIP LERQAVCYAA FCLACTAIYA LLQPRLRRRT VASATG