Gene Rsph17025_2828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2828 
Symbol 
ID5085106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2878268 
End bp2879269 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID640484398 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001169019 
Protein GI146278860 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.23634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGC GCATCGCTTC CGCCTGGCTT GCCGCGTCGC TTGCTGCCCT CGCCGCTCCG 
GCTTGGGCGC AGGAGACGTG CGGCCGGATC AGCATCGCCG AGATGAACTG GGCCTCGGCC
GGGGTGGCGG CGCAGGTGGA CCGGATCATC CTCGAGGAAG CCTTCGGCTG CGACGTGGAG
CTGGTCACCG GCGACACGAT GCCGACCTTC ACCTCGATGA ACGAGAAGGG CGAGCCGGAC
ATGGCGCCCG AGATGTGGGT GAACGCGGTC CGCGCCCCGC TCGACGCCGC CGTGGAGGAG
GGCCGGCTGG TGATCGCGGC GCCCATCCTC GAGGAGGGCG GCATCGAGGG CTGGTGGATC
CCGCGCTATC TGGCCGAGGC CCATCCCGAG ATCGACAGCG TCAAGGCGGC GCTCGCCCGT
CCCGAACTGT TTCCCGCGCC CGAGGATCCC TCGGTCGGCG CCGTCCACAA CTGCCCGCCC
GGCTGGAACT GCCAGATCTC GACCGAGAAC CTCTTCCGGG CGCTCGATGC CGAGAGCCGC
GGCTTCACGC TCGTTGATAC CGGCTCTTCG GCGGGGCTCG ACGGCTCGAT CGCCAATGCC
TACGAGCGCA GGGCCGGCTG GTTCGGCTAC TACTGGGCGC CGACGGCGAT CCTCGGCAAG
TATGACATGG TGCGCCTGCC GTTTTCGGTG CCCCACGACA AGGCCGAATG GGACAGCTGC
ACCGCGGTGC CCGACTGCGC CGAGCCCAGT GTGAATGCCT ATCCGGTGTC CGAGGTCTTT
ACCGTCGTCA CCCCCGCCTT TGCCGAGAAG GCCGGCGTGG CCATGGACTA TGTCGGCGCG
CGCCGATGGA GCAACGGCAC CGTGGGCGCG GTGCTGGCCT GGATGGACGA GAACCAGGCC
ACGAACGAGG AGGCGGCGCG TCACTTCCTC GAAACCTACC CCGAGTTGTG GCGTGCCTGG
CTTCCGGCCG AGGCCGCCGA CCGGGTCGCC GCGGCGCTCT GA
 
Protein sequence
MTKRIASAWL AASLAALAAP AWAQETCGRI SIAEMNWASA GVAAQVDRII LEEAFGCDVE 
LVTGDTMPTF TSMNEKGEPD MAPEMWVNAV RAPLDAAVEE GRLVIAAPIL EEGGIEGWWI
PRYLAEAHPE IDSVKAALAR PELFPAPEDP SVGAVHNCPP GWNCQISTEN LFRALDAESR
GFTLVDTGSS AGLDGSIANA YERRAGWFGY YWAPTAILGK YDMVRLPFSV PHDKAEWDSC
TAVPDCAEPS VNAYPVSEVF TVVTPAFAEK AGVAMDYVGA RRWSNGTVGA VLAWMDENQA
TNEEAARHFL ETYPELWRAW LPAEAADRVA AAL