Gene Rsph17029_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3398 
Symbol 
ID4898289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp452064 
End bp453059 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID640113995 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001045263 
Protein GI126464150 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.586433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA TCCACGGCCT TCTGGCCGCC GCCCTTCTTG CGACTGGCGC ACAGGCGCAG 
GACTACAGCA GCCGCACCAT CAAGTTCGCC GCCACCGGTC AGGAAGGTAC GCCGCCGGTG
CAGGGCATGC ATATCTTCGC GCAGAAGCTC GAGGAGCAGA GCGGCGGCAA GCTGAAGACG
CGCGTCTTCG CCAATGGCGT GCTGGGCGGC GATGTGCAGG TGCTGTCGTC GCTTCAGGGC
GGCGTGGTCG AGATGATGGT CTGGAACGCC GGCAACATGA TGACCCAGGC GCAGGATTTC
GGCATCCTCG ATCTGCCCTT CATCTATCAG GACGAAGAGG TGATGGATAC GCTGCTCGAC
GGCGAAGTCG GCAGGAAGCT CACCGATCAG CTGCCCGAGC ATGGCGTGAT CGGCCTGTCC
TTCTGGGAAC AGGGCTTCCG CCAGCTGACC AACGACACCC GCGAGGTGCA CAGGCTCGAG
GATATTGCGG GCCTCAAGGT CCGCGTGCAG CAGAACCCGC TGCTCGTCGA CATGTGGAAG
GCGCTTGGCG CCAATCCCAC GCCGATGGCG GTGACCGAAC TCTACACCGC GCTCGAGACC
GGCGCCGTGG ACGGGCAGGA ATGCACCGCG CCCTTCGCTC TCACCGCGAA ATATACCGAG
GTGCAGAAAT ATCTCTCGGT CACCCGCCAC AACTACAATC CGCAGATCGT GCTGATCGGC
AAACCCTTCT GGGACAAGCT CACCGACGAT GAAAAGGCCC TGATCCAGAA GGTCGCGCAG
GAGACTGCGG TCGAACAGCG CCGCATTTCG CGCGCGGCGC AGGACAGCGC GCTGGAGGAG
ATCCGGGCGG CTGGCAATGT CGTGACCGAG ATCACCCCCG AAGAGCTCGC CCGCATGCAG
GAGGCCGTCG CCCCGGTCAT CCGCACCTAT GCACAGACCT TCGATCCCGA GCTCGTGCGC
ACCGTCTTCG ATGCGGTCGG CTTCTCGCTG GATTGA
 
Protein sequence
MKLIHGLLAA ALLATGAQAQ DYSSRTIKFA ATGQEGTPPV QGMHIFAQKL EEQSGGKLKT 
RVFANGVLGG DVQVLSSLQG GVVEMMVWNA GNMMTQAQDF GILDLPFIYQ DEEVMDTLLD
GEVGRKLTDQ LPEHGVIGLS FWEQGFRQLT NDTREVHRLE DIAGLKVRVQ QNPLLVDMWK
ALGANPTPMA VTELYTALET GAVDGQECTA PFALTAKYTE VQKYLSVTRH NYNPQIVLIG
KPFWDKLTDD EKALIQKVAQ ETAVEQRRIS RAAQDSALEE IRAAGNVVTE ITPEELARMQ
EAVAPVIRTY AQTFDPELVR TVFDAVGFSL D