Gene Rsph17029_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3787 
Symbol 
ID4898667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp914013 
End bp915014 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content68% 
IMG OID640114391 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001045639 
Protein GI126464526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.347149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGC GCCTCGCTTC CGGTTGCCTC GCCGTATCCT TCGTAGCCCT CGCGGCTCCC 
GCCTGGGCGG CCGAAGACTG CGGCCGGATC AGCATCGCCG AGATGAACTG GGCCTCGGCC
GGCGTCGCGG CCCAGGTCGA CAAGATCATC CTCGAGGAAG GGTTCGGCTG CACGGTCGAG
CTGGTGGCCG GCGACACGAT GCCCACCTTC ACCTCGATGA ACGAAAAGGG CGAGCCCGAC
ATGGCGCCCG AACTCTGGGT CAATGCCGTG CGCACCCCGC TCGAGGCGGC GGTCGAGGAG
GGGCGCATGG TCGTCGCGGC GCAGATCCTG AAGGACGGCG GCGTCGAGGG CTGGTGGATC
CCGCGTTACC TCGCCGAGGC CCATCCCGAG ATCGACAGTG TCGAGAAGGC GCTGGAGCAT
CCCGATCTCT TCCCCGCCCC CGAGGATGCC TCGCGCGGCG CCGTCCACAA CTGCCCCTCG
GGCTGGAACT GCCAGGTCTC GACCGAGAAC CTGTTCCGGG CGCTCGACGC CGAAGACCAC
GGCTTCGATC TCGTGGACAC GGGCTCGGCC GCGGGGCTCG ACGGCTCGAT CGCCAATGCC
TACGAGCGCG AGGCGGGCTG GCTCGGCTAT TACTGGGCGC CGACGGCCAT CCTCGGCAAA
TACGACATGG TGCGCCTGCC CTTCTCGGTG CCGCACGACA AGGCCGCCTG GGATGCCTGC
ACCGCGGTGC CGGACTGCGC CGACCCCGAG GTGAACTCCT ATCCGGTGTC CGAGGTCTTC
ACCGCCGTCA CCCCCTCCTT CGCCGAGAAG GCAGGTGTGG CCATGGATTA TGTGAAGGCC
CGCAGCTGGA GCAACGAGAC GGTGGGCCAG ATTCTCGCCT GGATGGACGA GAGCCAGGCC
ACCAACGAGG ATGCGGCCTA CCATTTCCTC GAAACCTATC CCGACCTCTG GCGCGCCTGG
CTTCCGGCCG ACGTGGCCGA CCGGGTCGCC GCGGCGCTCT GA
 
Protein sequence
MTKRLASGCL AVSFVALAAP AWAAEDCGRI SIAEMNWASA GVAAQVDKII LEEGFGCTVE 
LVAGDTMPTF TSMNEKGEPD MAPELWVNAV RTPLEAAVEE GRMVVAAQIL KDGGVEGWWI
PRYLAEAHPE IDSVEKALEH PDLFPAPEDA SRGAVHNCPS GWNCQVSTEN LFRALDAEDH
GFDLVDTGSA AGLDGSIANA YEREAGWLGY YWAPTAILGK YDMVRLPFSV PHDKAAWDAC
TAVPDCADPE VNSYPVSEVF TAVTPSFAEK AGVAMDYVKA RSWSNETVGQ ILAWMDESQA
TNEDAAYHFL ETYPDLWRAW LPADVADRVA AAL