Gene Rsph17025_2830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2830 
Symbol 
ID5085108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2880249 
End bp2881376 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID640484400 
ProductABC transporter related 
Protein accessionYP_001169021 
Protein GI146278862 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.461635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.240559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC AGTCCGTTCC CACCTTCGAG GACGAGGACA AGGACGAGGT CGCCACCCGC 
CCCGGTCTGG AGATCCGCGG CCTCTACAAG ATCTTTGGCC CCAGCCCGTC GCGCTGGATC
GGCGCGGTGA AGGCCGGGAT GACCAAGACC GACCTCAACC GCCGCCACGG CCATGTGCTG
GGCCTGACCG ACATATCGCT GTCGATCCCG CCGGGGCGGA TCACCGTCAT CATGGGCCTG
TCCGGGTCGG GCAAGTCCAC GCTGATCCGC CACATCAACG GGCTGATCGC TCCCACGGCG
GGCGAGATCC TGTTTGACGG CACCGATGTC TGCCGCATGA GCGCGGCCGA GCTGCGCGGC
TTCCGCCGCA GCCGCACCGC GATGGTGTTC CAGAAGTTCG CGCTCCTGCC GCATCGCACG
GTGCTGGAGA ACACGCGCTA CGGGCTCGAC ATCCGCGGCG TTCCGCGCGC CGAGGCCGAG
AGGGCAGCGC GGCGCTGGAT CGCGCGCGTG GGCCTCGGTG GCTACGAGAA CAGCTATCCG
TCGCAGCTTT CGGGCGGGAT GCAGCAGCGC GTGGGGCTGG CGCGGGCGCT GGCCACCGAC
GCCGAGATCC TGCTGATGGA CGAAGCCTTT TCCGCGCTCG ATCCGCTGAT CCGGCTGGAC
ATGCAGAGGA TCCTGCTGGA GCTGCAGGAG GAGCTGCACC GCACCATCGT CTTCATCACC
CACGACCTCG ACGAGGCGCT GCGGCTGGGC GACCGGATCG CGATCCTGCG CGACGGCCGG
CTGGAGCAGG TGGGCACCGG GCAGGACATC GTGCTGCGGC CCGCGAACGA CTACATCGCC
GCCTTCGTCC ACGAGGTGAA CCGCGCCCGC GTGATCCGTC TCTCGGCCGT GGCGACGCCG
CTGGTTGAGA CCGACGAGGC GCCGCGCCTG GCCCTGCCGG ACCGGCTCGT GCTCGAGGAG
GCCGCGCGCG AGATGCTCGC CGCCGGGGCC GAGCGCGCCC TTGTGGTCGG CCCGCGCCGC
AGGCCGCTGG GGATCGTGCG GATCGGCGAC CTGCTGGCGG GCATGGTCCG CCCGTCCGGC
CATCCCCCGA CGGATCAAGC CCCGACCAAC CAGAGGAGGA AGCAATGA
 
Protein sequence
MTDQSVPTFE DEDKDEVATR PGLEIRGLYK IFGPSPSRWI GAVKAGMTKT DLNRRHGHVL 
GLTDISLSIP PGRITVIMGL SGSGKSTLIR HINGLIAPTA GEILFDGTDV CRMSAAELRG
FRRSRTAMVF QKFALLPHRT VLENTRYGLD IRGVPRAEAE RAARRWIARV GLGGYENSYP
SQLSGGMQQR VGLARALATD AEILLMDEAF SALDPLIRLD MQRILLELQE ELHRTIVFIT
HDLDEALRLG DRIAILRDGR LEQVGTGQDI VLRPANDYIA AFVHEVNRAR VIRLSAVATP
LVETDEAPRL ALPDRLVLEE AAREMLAAGA ERALVVGPRR RPLGIVRIGD LLAGMVRPSG
HPPTDQAPTN QRRKQ