Gene Rsph17029_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2138 
Symbol 
ID4895223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2265513 
End bp2266490 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content63% 
IMG OID640112732 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001044013 
Protein GI126462899 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.231468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.866818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CAATGATCGC GACCCTGCTG GCGAGCGCCG CGCTCGCGGC ACCCGCCTTC 
GCCGAGTGCG AGGTGACGCT GCGGTCTTCG GACACGCACC CGGATGGCTA TCCGACCGTC
GAGGGCGTCA AGTTCATGGC CGAGCGCGCC AAGGAACTGT CGAACGGGCG CATCTGCATC
GAGGTCTTCC CCTCGTCGCA GCTCGGCGAA GAGAAGGACA CGATCGAGCA GACCCAGTTC
GGCGTGATCG ACATGGTGCG CGCCTCGTTC GGCTCGTTCA ACGACATCGT GCCCGAGGCG
CAGCTCCTGT CGCTGCCCTA CCTCTTCCGC TCGGAAGAGC ATCTGCACAA TGTGATGGAC
GGCCCGATCG GCGACGAGCT CGCCAAGGCC TTCGAGGCCA AGGACCTGAT CGCGGTGGCC
TACTATGACG GTGGCTCGCG CAGCTTCTAC AACAGCCAGA AGCCGATCAC CAAGGTCGAG
GACCTCAAGG GCATGAAGTT CCGCGTCATG CAATCGGACG TGTTCGTGGA CATGATGTCC
GCGCTCGGCG CCAATGCGAC GCCGATGCCC TACGGCGAGG TCTATTCCTC GATCCAGACC
GGCGTCATCG ACGGGGCCGA GAACAACTGG CCGTCCTACG ACAGCTCGGG CCATTTCGAG
GTGGCGAAAT ACTACACGCT CGACCAGCAT CTGATGGTGC CCGAGCTGGT GGCGATCTCG
AAGATCAAGT GGGACGCGCT CTCGCCCGAG GACCAGCAGG TGCTGCGTCA GGCGGCCGAA
GAGTCCGAGC CCGTGCAGCG CAAGCTCTGG GCCGAGCAGG AGAAGGCCTC GGAAGAGAAG
GTCGTGGCCT CCGGCGCTGA GGTCGTGCGC GAGATCGACA AGACCCCCTT CATCGAGGCG
ATGGCTCCGG TCTACGAGAA ATACGTGACC AAGTCGGAAT ATCAGGATCT CGTGAAGCGG
ATCCAGGAAA CCCAGTGA
 
Protein sequence
MKKTMIATLL ASAALAAPAF AECEVTLRSS DTHPDGYPTV EGVKFMAERA KELSNGRICI 
EVFPSSQLGE EKDTIEQTQF GVIDMVRASF GSFNDIVPEA QLLSLPYLFR SEEHLHNVMD
GPIGDELAKA FEAKDLIAVA YYDGGSRSFY NSQKPITKVE DLKGMKFRVM QSDVFVDMMS
ALGANATPMP YGEVYSSIQT GVIDGAENNW PSYDSSGHFE VAKYYTLDQH LMVPELVAIS
KIKWDALSPE DQQVLRQAAE ESEPVQRKLW AEQEKASEEK VVASGAEVVR EIDKTPFIEA
MAPVYEKYVT KSEYQDLVKR IQETQ