Gene Rsph17029_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1945 
Symbol 
ID4895283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2063605 
End bp2064885 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content62% 
IMG OID640112539 
ProductABC branched chain amino acid transporter, substrate binding protein 
Protein accessionYP_001043821 
Protein GI126462707 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00107842 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGT TCCGGGCATA TCTGATCGGC ACGGCCCTCG GTCTGTCGCT CGCGGGCGGG 
GCGCTCGCGC AGGAGGACAC GATCAAGATC GGCGTGCTCC ATTCGCTCTC GGGCACGATG
GCGATTTCCG AGACGACGCT GAAGGACACC GTCCTGATGC TCGTCGATCA GCAGAACGCC
AAGGGCGGCC TTCTGGGCAA GAAGCTCGAG GCGGTGGTGG TGGACCCCGC CTCCGACTGG
CCGCTCTTCG CCGAGAAGGC GCGCGAACTG CTGACCGTGA ACGATGTCGA CGTGATCTTC
GGCTGCTGGA CCTCGGTCAG CCGCAAGTCG GTGCTGCCGG TGATCGAGGA GTTGAACGGC
CTCCTGTTCT ACCCGGTGCA GTATGAGGGC GAGGAGAGCT CGAAGAACGT CTTCTACACC
GGTGCCGCGC CGAACCAGCA GGCGATTCCG GCGGTGGACT ATTTCCTCGA GGAACTGGGC
GTCGAGAAAT TCGCCTTGCT CGGCACCGAC TACGTCTATC CGCGCACGAC GAACAACATC
CTCGAGAGCT ACCTTCAGCA GAAGGGCATC GCGAAATCCG ACATTTTCGT GAACTACACG
CCCTTCGGCC ATTCCGACTG GTCGAAGATC GTGGCGGACG TGAAGGCGCT CGGCGCGGAC
GGCAAGAAGG TGGGCGTGAT CTCGACCATC AACGGGGATG CGAACATCGG CTTCTACAAG
GAACTCGCGG CCGCAGGCAT CTCGGCCGAG GACATTCCCG TCGTGGCCTT CTCGGTGGGC
GAGGAGGAAC TCTCGGGCCT CGACACGTCG AACCTCGTGG GCCATCTCGC GGCCTGGAAC
TACTTCCAGT CCGCCGAAAG CCCCGAGAAC GAAGCCTTCA TCAAGGAATG GAAGGCCCGC
ATGGGTGAGA AGCGGGTGAC GAACGACCCG ATGGAGGCCA CCTACATCGG CTTCAACATG
TGGGTGAATG CCGTAACCGC GGCGGGCACC ACTGATGTGG ATCCGGTGGC CAAGGAGATG
ATCGGGCAGA AATTCCCGAA CCTCACCGGC TCCGAGGCCG AGATGCTGCC GAACCACCAT
CTGACCAAGC CCGTGCTGAT CGGCGAGATC CGCGACGACG GCCAGTTCGA CATCATCTCG
CAGACCGATC CGGTGCCGGG CGATGCCTGG ACGGACTTCC TGCCGGAATC GGCCGTGCTC
GAGTCCGACT GGGCCAAGCT CGACTGCGGC ATGTACAACA CCGAGACCAA GAGCTGCGTG
CAGATCAAGT CGAACTACTG A
 
Protein sequence
MKMFRAYLIG TALGLSLAGG ALAQEDTIKI GVLHSLSGTM AISETTLKDT VLMLVDQQNA 
KGGLLGKKLE AVVVDPASDW PLFAEKAREL LTVNDVDVIF GCWTSVSRKS VLPVIEELNG
LLFYPVQYEG EESSKNVFYT GAAPNQQAIP AVDYFLEELG VEKFALLGTD YVYPRTTNNI
LESYLQQKGI AKSDIFVNYT PFGHSDWSKI VADVKALGAD GKKVGVISTI NGDANIGFYK
ELAAAGISAE DIPVVAFSVG EEELSGLDTS NLVGHLAAWN YFQSAESPEN EAFIKEWKAR
MGEKRVTNDP MEATYIGFNM WVNAVTAAGT TDVDPVAKEM IGQKFPNLTG SEAEMLPNHH
LTKPVLIGEI RDDGQFDIIS QTDPVPGDAW TDFLPESAVL ESDWAKLDCG MYNTETKSCV
QIKSNY