Gene RSP_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3059 
Symbol 
ID3721646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp99471 
End bp100472 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID640072736 
ProductABC proline/glycine betaine transporter, periplasmic substrate-binding protein 
Protein accessionYP_354577 
Protein GI77465074 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.78778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGC GCCTCGCTTC CGGTTGCCTC GCCGTATCCT TCGCAGCCCT CGCGGCTCCC 
GCCTGGGCGG CCGAAGACTG CGGCCGGATC AGCATCGCCG AGATGAACTG GGCCTCGGCC
GGCGTCGCGG CCCAGGTCGA CAAGATCATC CTCGAGGAAG GGTTCGGCTG CACGGTCGAG
CTGGTGGCCG GCGACACGAT GCCCACCTTC ACCTCGATGA ACGAAAAGGG CGAGCCCGAC
ATGGCGCCCG AACTCTGGGT CAATGCCGTG CGCACCCCGC TCGAGGCGGC GGTCGAGGAG
GGGCGCATGG TCGTCGCGGC GCAGATCCTG AAGGACGGCG GCGTCGAGGG CTGGTGGATC
CCGCGTTACC TCGCCGAGGC CCACCCCGAG ATCGACAGTG TCGAGAAGGC GCTGGAGCAT
CCCGATCTCT TCCCCGCCCC CGAGGATGCT TCGCGCGGCG CCGTCCACAA CTGCCCCTCG
GGCTGGAACT GCCAGGTCTC GACCGAGAAC CTGTTCCGGG CGCTCGGCGC CGAAGACCAC
GGCTTCGATC TCGTGGACAC GGGCTCGGCC GCGGGGCTCG ACGGCTCGAT CGCCAATGCC
TACGAGCGCG AGGCGGGCTG GCTCGGCTAT TACTGGGCGC CGACGGCGAT CCTCGGCAAA
TACGACATGG TGCGCCTGCC CTTCTCGGTG CCGCACGACA AGGCCGCCTG GGATGCCTGC
ACCGCGGTGC CGGACTGCGC CGATCCCGAG GTGAACTCCT ATCCGGTGTC CGAGGTCTTC
ACCGCCGTCA CGCCCTCCTT CGCCGAGAAG GCCGGTGTGG CCATGGACTA TGTGAAGGCC
CGCAGCTGGA GCAACGAGAC GGTGGGCCAG ATTCTCGCCT GGATGGACGA GAGCCAGGCC
ACCAACGAGG ATGCGGCCTA CCATTTCCTC GAGACCTATC CCGACCTCTG GCGCGCCTGG
CTTCCGGCCG ACGTGGCCGA CCGGGTCGCC GCGGCGCTCT GA
 
Protein sequence
MTKRLASGCL AVSFAALAAP AWAAEDCGRI SIAEMNWASA GVAAQVDKII LEEGFGCTVE 
LVAGDTMPTF TSMNEKGEPD MAPELWVNAV RTPLEAAVEE GRMVVAAQIL KDGGVEGWWI
PRYLAEAHPE IDSVEKALEH PDLFPAPEDA SRGAVHNCPS GWNCQVSTEN LFRALGAEDH
GFDLVDTGSA AGLDGSIANA YEREAGWLGY YWAPTAILGK YDMVRLPFSV PHDKAAWDAC
TAVPDCADPE VNSYPVSEVF TAVTPSFAEK AGVAMDYVKA RSWSNETVGQ ILAWMDESQA
TNEDAAYHFL ETYPDLWRAW LPADVADRVA AAL