Gene Rsph17029_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3645 
Symbol 
ID4898757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp744256 
End bp745269 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content65% 
IMG OID640114253 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001045507 
Protein GI126464394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.650504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.152004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCA ACCGTCGCCG TTTCGTCGCG ACCGCAGGTG GCGTGCTGCT CGCCGCCCCC 
TTCCTTCGGC CGGGCATGGC CCATGCCGCC GAATATTCCT ACAAGTACGC CAACAACTTC
CCCGTCGGCC ATCCGATGAA CACCCAGATG GAGGCCGCGG CGAAGCGGAT CTCCGAAGAG
ACCGACGGCC GGTTCGAGCT GAAGATCTTC CCGAACAACC AGCTCGGCTC CGACACGGAC
ACGCTGAATC AGGTGCGGTC GGGCGCGGTC GAGTTCTTCA CGCTGTCGGG ACTGATCCTG
TCGTCGCTGG TGCCGGTCGC TTCGATCAAC GGCATGGGCT TTGCTTTCAA GGACATCGAC
CAAGTCTGGC AGGCGATGGA CGGCGAGCTC GGCGCCTATG TGCGCGAGCA GATCCGCGCC
AACCGCCTCG AGGTGATGGA CCGGATCTTC AACAACGGCT TCCGCCAGAT CACAACCTCC
AGCCGCCCGA TCGAGGGGCC GGACGATCTC GCCGGGCTGA AGATCCGCGT GCCGGTCAGC
CCGCTCTGGA CCTCGATGTT CCAGGCGCTC GGCTGCGCCC CGGTCAGCAT CAACTGGAAC
GAGGTCTATA CCTCGCTGCA GACCGGCGTC GTGGATGCGC AGGAGAACCC GCTGTCGACC
ATCGACGTGG GCAAGCTCTA CGAAGTGCAG ACCTACTGCT CGATGACGAA CCACATGTGG
GACGGCTTCT GGATGCTCGC CAACCCGCGG GCCTGGCGGG CGCTGCCCGA CGACCTGCAG
GAGATCGTGG CCCGCAACAT CAACCAGGCC GCGCTCGACC AGCGCGAGGA TCTGACCCAG
CTCAACGCGA CGCTCCAGAG CGAGCTGGAA AAGGACGGGC TGATCTTCAA CAAGCCCGAC
ACCGCCGCCA TCCGCGACAA GCTGCGTGAG GCGGGCTTCT ACAGCGAATG GCGCAGCACC
TACGGCGACG AGGCCTGGGC GCTCCTCGAG CAGGTCTCGG GCTCGCTGGC CTGA
 
Protein sequence
MTINRRRFVA TAGGVLLAAP FLRPGMAHAA EYSYKYANNF PVGHPMNTQM EAAAKRISEE 
TDGRFELKIF PNNQLGSDTD TLNQVRSGAV EFFTLSGLIL SSLVPVASIN GMGFAFKDID
QVWQAMDGEL GAYVREQIRA NRLEVMDRIF NNGFRQITTS SRPIEGPDDL AGLKIRVPVS
PLWTSMFQAL GCAPVSINWN EVYTSLQTGV VDAQENPLST IDVGKLYEVQ TYCSMTNHMW
DGFWMLANPR AWRALPDDLQ EIVARNINQA ALDQREDLTQ LNATLQSELE KDGLIFNKPD
TAAIRDKLRE AGFYSEWRST YGDEAWALLE QVSGSLA