Gene Rsph17029_2569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2569 
Symbol 
ID4895814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2706730 
End bp2707734 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content66% 
IMG OID640113168 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001044443 
Protein GI126463329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.282286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.691286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGACCC GTCGCAGCCT CGCCGCTCTG GCAGGCGCCG CCGCGCTGGC GCTCGCCGCC 
GCCGTGCCGG CTCTCGCCCA GCCGATCGTC ATCAAGTTCA GCCACGTCGT CGCCCCCGAC
ACGCCGAAGG GCAAGGGCGC CACGAAGTTC GAGGAACTGG CGGAGAAATA CACCGACGGC
GCGGTGGATG TCGAAGTCTA CCCCAACAGC CAGCTCTACA AGGACAAGGA AGAGCTCGAG
GCGCTGCAGC TCGGCGCGGT CCAGATGCTC GCCCCGTCGC TGGCCAAGTT CGGCCCGCTC
GGCGTGCAGG ATTTCGAGGT CTTCGACCTG CCCTACATCT TCAAGGGCTA TGACGCGCTG
CACACCGTGA CCAACGGCGA GGTGGGCAAG ATGCTGTTCT CGAAGCTCGA GGACAAGGGC
ATCAAGGGCC TCGCCTACTG GGACAACGGC TTCAAGATCA TGTCGGCCAA CAGCCCGATC
GCCACGCCCG ACGACTTCCT CGGGCTGAAG ATGCGCATCC AGTCCTCGAA GGTGCTCGAG
GCGCAGATGA ACGCGCTCGG CGCGGTGCCG CAGGTCATGG CCTTCTCCGA GGTCTATCAG
GCGCTGCAGA CCGGCGTCGT GGACGGCACC GAGAACCCGC CCTCGAACAT GTATACCCAG
AAGATGCACG AGGTGCAGAA GCACGCCACG GTCTCGAACC ACGGCTACCT CGGCTATGCG
GTGATCGTGA ACAAGCAGTT CTGGGACGGC CTGCCCGAAG AGGTGCGCGC CGGGCTCGAG
AAGGCGCTGA CCGAGGCCAC CGACTATGCC AACGGCATCG CCAAGGAAGA GAACGACAAG
GCGCTGCAGG CGATGAAGGA CGCGGGCACG ACCGAGTTCC ACGAGCTGAC CCCCGAAGAG
CTCGCGGCCT GGGAAGAGGT GCTCGCCCCC GTCCATGAGG AAATGGCCGG CCGCATCGGC
GCCGAGACCA TCGCCGCCGT GAAGGCCGCG ACCGGGACCA ACTGA
 
Protein sequence
MLTRRSLAAL AGAAALALAA AVPALAQPIV IKFSHVVAPD TPKGKGATKF EELAEKYTDG 
AVDVEVYPNS QLYKDKEELE ALQLGAVQML APSLAKFGPL GVQDFEVFDL PYIFKGYDAL
HTVTNGEVGK MLFSKLEDKG IKGLAYWDNG FKIMSANSPI ATPDDFLGLK MRIQSSKVLE
AQMNALGAVP QVMAFSEVYQ ALQTGVVDGT ENPPSNMYTQ KMHEVQKHAT VSNHGYLGYA
VIVNKQFWDG LPEEVRAGLE KALTEATDYA NGIAKEENDK ALQAMKDAGT TEFHELTPEE
LAAWEEVLAP VHEEMAGRIG AETIAAVKAA TGTN