Gene RSP_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4016 
SymboldctP 
ID3712045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007488 
Strand
Start bp30256 
End bp31311 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content66% 
IMG OID640069322 
ProductTRAP dicarboxylate family transporter DctP subunit 
Protein accessionYP_345189 
Protein GI77404615 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.807538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCAT GGATTCGCCA CAAGAGCGAG CCGCCAACAG AGAGAGAGAC CGAGATGACC 
CAAGCGCAGA CATTCCTCGC CGGAACCGCC CTGGCCCTGC TGGCCGCCCT GCCGGCCTGG
GCCGAAGAGT TCCGCATCGC CGTGGGCGAT GGCGCGGGCG GCACGCAGGA GGCGCTCGGC
AAGGCCTTCG TCGCGGCGCT CGAGAAGGAG TCGGGCGGCG AGATGACGGG CAAGCTGTTC
CTGAACGGCC AGCTCGGCGA CGAGCAGGAC ACGGTGACGG CCGCGGCTAC CGGCACGCTC
GACTTCTCGA TCCTCGCGAT CAACAACATC ACGCCCTTCT CGCCCTCGGT GGGCACGCTG
ACGCTGCCCT ATGTCATCCT GAGCCAGGAG GATGCCGAGA CGGTCACGCA GGGCGAGGTC
GGCCGGCAGA TGATCGAGAA GACGGTCGAG GATGCGGGCG TGCGCATCAT CGGCTGGGGC
TATTCGGGCT TCCGGGTGCT GACCAATTCG AAGAAGCCCG TGGCCTCGGT CGAGGACATG
CAGGGGCTGA TCGTGCGCGT GCCCAAGAAC GAGATCATGA TCGAGACCTA CAAGAGCTGG
GGCATCAACC CCACGCCGAT GGCTTGGGGC GAGACCTTCG CGGCGCTTCA GCAGAAGGTC
GTGGACGGGC AGGACAATCC CTACATGACC GTCTATGCGA TGAAGTTCGA CGAGGTGCAG
AAATATGTCA CCGAGCTGCG CTACATCTTC TCGATCGAGC CGCTGATCGT GAGCGAGGCC
CTGTTCGAGG GGCTGAGCGA GGAGAAGCAG GCGCAGATCC TCGCCGCGGG CGAGGCGGCG
ACGCAGGCCT CCTCGGCCTT CCTGCGCGAG CAGGAGAGCC GGATCCGCGA CGAGCTGGTG
GCGCGCGGCA TGGAGATCAC GCCGCCCGCG GACGGCGAGA AGGGCTTCAT CGAGCTGGCG
ACCGCGCAGG TCTGGCCCAA GTTCGCCGAC CAGATCGGCG GCATCGAGGT GCTGAACGGC
GTCTTGACCT CGCTCGGCCG GCCCACCGTC CAGTAA
 
Protein sequence
MSSWIRHKSE PPTERETEMT QAQTFLAGTA LALLAALPAW AEEFRIAVGD GAGGTQEALG 
KAFVAALEKE SGGEMTGKLF LNGQLGDEQD TVTAAATGTL DFSILAINNI TPFSPSVGTL
TLPYVILSQE DAETVTQGEV GRQMIEKTVE DAGVRIIGWG YSGFRVLTNS KKPVASVEDM
QGLIVRVPKN EIMIETYKSW GINPTPMAWG ETFAALQQKV VDGQDNPYMT VYAMKFDEVQ
KYVTELRYIF SIEPLIVSEA LFEGLSEEKQ AQILAAGEAA TQASSAFLRE QESRIRDELV
ARGMEITPPA DGEKGFIELA TAQVWPKFAD QIGGIEVLNG VLTSLGRPTV Q