Gene Rsph17029_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4029 
Symbol 
ID4898630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1175343 
End bp1176545 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content65% 
IMG OID640114632 
ProductABC branched-chain amino acid transporter, periplasmic binding protein 
Protein accessionYP_001045879 
Protein GI126464766 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAAG CCCTTCTTTC TGCGGTGTCG CTGGCGGCCT TGGCCGCGCC TGCGGCAGCC 
CAGGTGTCGG ATGATCTTGT GAAGATCGGC ATCCTGAACG ACCAGTCCGG GGTCTATGCG
GACTTCGGCG GCAAATACAG CTACGAGGCC GCGCTGATGG CGGTCGAGGA TTACGGCGGC
TCGGTCCTCG GCAAGAAGGT CGAGGTCGTG ACGGCCGACC ACCAGAACAA GGCCGACATC
GCCTCGAACA TCGCGCGCCA GTGGTACGAC ACCGAGCAGG TCGACGCGAT CATGGAGCTG
ACCACCTCGT CGGTGGCGCT CGCCGTTCAG GCGCTGTCGC AGGAGAAGAA GAAGGTCACC
ATCACCACCG GGGCGGCCAC GACCGAGCTG ACGGGCAAGC AATGCTCTCC CTATGGCTTC
CACTGGGCCT ATGACACGCA CGCGCTGGCC ATCGGCACCG GCGGCGCGCT GGTCGAACAG
GGCGGCGACA GCTGGTATTT CCTCACTGCC GACTATGCCT TCGGCTATTC TCTCGAAGAG
AATACCGGCG CGGTCGTGAA GGAGAAGGGC GGCCAGGTTC TGGGCGCCGT GCGTCATCCG
CTTTCCACCA CCGACTTCTC CTCGTTCCTG CTTCAGGCGC AGGCCTCGGG CGCCAAGGTC
ATCGGCCTCG CCAATGCGGG TCTCGATACG CAGAATGCCA TCAAGCAGGC GGGCGAGTTT
GGCATCGTGC AGGGCGGCCA GCGGCTTGCA GCGCTCCTCT TCACGCTGGC CGAGGTTCAC
GGGCTCGGCG TCGAATCGGC GCAGGGCCTG ACGCTGACCG AGAGCTTCTA CTGGAACCGC
AACGAGGAAT CGGCCGAGTT CGGCAAGCGG TTCATGGAGC GGACGGGCGC GATGCCGAAC
ATGATCCATG CCGGCACCTA CTCGGCCGTG CTCTCCTATC TCAAGGCGGT CGAGGCCGCG
GGCTCCGACG AGACCGAGGC CGTCTCGGCC AAGCTGCACG AACTGCCGGT CGAGGATGTC
TTCGCCAAGG GCGGCAAAGT TGCACCCAAC GGGCGCATGA TCTCGGACGT CTATCTGCTC
GAGGTGAAGA AGCCCGGCGA GAGCGACGTG CCGTGGGACT ATTACAACGT CCTCGCCACC
ATTCCGGGCG ATCAGGCCTA TCTCGACCCG GCCCAGAGCG GCTGCCCGCT CGTCACGAAT
TGA
 
Protein sequence
MRKALLSAVS LAALAAPAAA QVSDDLVKIG ILNDQSGVYA DFGGKYSYEA ALMAVEDYGG 
SVLGKKVEVV TADHQNKADI ASNIARQWYD TEQVDAIMEL TTSSVALAVQ ALSQEKKKVT
ITTGAATTEL TGKQCSPYGF HWAYDTHALA IGTGGALVEQ GGDSWYFLTA DYAFGYSLEE
NTGAVVKEKG GQVLGAVRHP LSTTDFSSFL LQAQASGAKV IGLANAGLDT QNAIKQAGEF
GIVQGGQRLA ALLFTLAEVH GLGVESAQGL TLTESFYWNR NEESAEFGKR FMERTGAMPN
MIHAGTYSAV LSYLKAVEAA GSDETEAVSA KLHELPVEDV FAKGGKVAPN GRMISDVYLL
EVKKPGESDV PWDYYNVLAT IPGDQAYLDP AQSGCPLVTN