Gene RSP_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4000 
SymbolproV 
ID3711787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007488 
Strand
Start bp48080 
End bp49168 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID640069338 
Productglycine betaine/L-proline ABC transporter ATPase 
Protein accessionYP_345205 
Protein GI77404631 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAAT CCGCAAGAGA GGACGTCATC CGCTGCGAGG GGATCTGGAA GATCTTCGGC 
CGCAGGTCGC GTCAGGCGAT GGAGGCCGTC CGCAGCGGGG GCCTGTCGAA GACCGAGATC
CGGGAACGGT TCGACTGCGT GGTGGGGGTG CAGGACGCCT CCTTCAGCGT GAAGCGCGGC
GAGATCTTCT GCATCATGGG CCTGTCCGGC TCGGGCAAGT CGACGCTGAT CCGGCACATC
AACCGGCTGA TCGAGCCCAC TTCCGGCTCG GTCTTCATCG AGGGGCAGAA CGTCAATGCG
ATGAACGCCC GCGACCTGCG CGCGCTGCGG GCCCAGCGGA TCGGCATGGT GTTCCAGAGC
ATGGCGTTGA TGCCGCACCG GACGGTGCGC GACAATGTCG TCTTCTCGCT CGAGGTGCGG
GGCCGGCCCG AGGAGGAGCG CGCACGGGTT GCGGCGCAGG CCATCGAGGC GGTGGACCTG
ACGGGATGGG AAACGAAATA TCCCGACGAG CTGTCGGGGG GAATGCAGCA GCGCGTGGGC
TTGGCCCGTG CCATCGCTGC CGACCCGACC ATCCTGCTGA TGGACGAGCC CTTCTCGGCG
CTCGACCCGC TGATCCGCAA GCAGCTTCAG ACCACCTTCA TGGCCCTCTC GGCCGAGCTG
CACAAGACCA CGGTCTTCAT CACCCACGAC CTCGACGAGG CCATCCGCAT CGGTGACCGG
ATCGCGATCA TGAAGGACGG GGTGCTGGTG CAGATCGGCA CGCCCGAAGA GATCGTGACC
GAGCCGGCCG ACGAGTATGT GGCCGATTTC GTGGCCGGGA TCTCGAAGCT CGACCTCGTG
TCGGCGGCGC GCATCATGCA GCCCTTCGAG CAGTATCGCC GGACGCGGCC CACGGACGGG
ATCGAGGCCT GGCCGGTGGC GCGCCCCGAC GACAAGCTGA ACCGGCTCGT CGATCTGGCG
GTCGGCACCG ATCATCCGAT CCTCATCAAG GATGCGGACG CCGTGGTGGG TGTCGTGGGA
AAGCGTGCGC TCCTGCGCGG CATCCAGGGC CGCGAGGACG CGGCCGCTTG CCAGGCGGAG
GCCGTCTGA
 
Protein sequence
MTKSAREDVI RCEGIWKIFG RRSRQAMEAV RSGGLSKTEI RERFDCVVGV QDASFSVKRG 
EIFCIMGLSG SGKSTLIRHI NRLIEPTSGS VFIEGQNVNA MNARDLRALR AQRIGMVFQS
MALMPHRTVR DNVVFSLEVR GRPEEERARV AAQAIEAVDL TGWETKYPDE LSGGMQQRVG
LARAIAADPT ILLMDEPFSA LDPLIRKQLQ TTFMALSAEL HKTTVFITHD LDEAIRIGDR
IAIMKDGVLV QIGTPEEIVT EPADEYVADF VAGISKLDLV SAARIMQPFE QYRRTRPTDG
IEAWPVARPD DKLNRLVDLA VGTDHPILIK DADAVVGVVG KRALLRGIQG REDAAACQAE
AV