Gene Rsph17029_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3541 
Symbol 
ID4899100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp627975 
End bp628964 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content65% 
IMG OID640114150 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001045404 
Protein GI126464291 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.106923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGA AAACACTGGT GGCCTCGGCC CTGTGCCTGA TGATGGCGCC TGCCGCCTTC 
GCGCAGGACT ACACCATCCG GCTGTCGCAC GGCGACAACG AGAGCAATCC GACCCACCTG
ACGGCGGTGA AGTTCCAGGA GCTGGTGAAG GAATACACTG AAGGCAAGGC CGAGGTGCAG
ATCTTCCCGA GCAACTCGCT CGGCACCGAA ACCGAGGTGG CGCAGGCGCT GCGGATGGGC
TCCATCGAGG CCGAGATCCT CTATACCGGC AACCTCGTGC CGCTCGCGCC TTCGGCCGGC
GTCCTGATGC TGCCCTACGC CTATACCTCG ACCGAGCAGG CGCACAAGGC GATGGATGCG
CTGATCGATC CGCTGAACGA GCGTCTGACC AAGGAAGCCG GCGTGCGCGC GCTCGGGCTG
ATGGAGAAGG GCTTCCGGGT CCTGACCACC AACAAGCCCG TGACCACGCT CGAGGATCTG
AAGGGCCTCA AGATCCGCGT CTCGCCCAAC GACATCGCGA TCAAGACCTT CCGCGCCTGG
GGGATCGAGC CCCTGCCGAT GGACTGGGCC GAGGTCTTCC CCGCGCTGCA GCAGCGCGTG
ATCGACGGTC AGGAGAACCC CTACACCACG GCCATCTCCT CGCGCTTCTT CGAGGTTCAG
AGCGACATCA CCGAGATCCA CTACATGATG TGGACAGGCC CGCTCCTGAT CAGCGAGCGC
GCCTTCCAGA AATATCCCGA GGATATCCAG CAGGCGCTGC TGCGCGCCGG CCGCGAGGCG
GTGGACTACG GGCGGCAGGT GTCGGCCGAG CTCACCGAAC AGTCGAAGGC CGAGCTGGTG
AAGAACGACA TGACCCTGCA CGGCGCGCCG AAGGACGAGG AGAAGTGGGA AGCGGCGGCC
GCGGCCCTCT GGCCGGAGTT CTACGACCAG ATCGGCGGCG AGGAATGGGC CACGCAGGCC
ATCGAGATCA TCAAGGCCAC CGAGAAGTAA
 
Protein sequence
MLKKTLVASA LCLMMAPAAF AQDYTIRLSH GDNESNPTHL TAVKFQELVK EYTEGKAEVQ 
IFPSNSLGTE TEVAQALRMG SIEAEILYTG NLVPLAPSAG VLMLPYAYTS TEQAHKAMDA
LIDPLNERLT KEAGVRALGL MEKGFRVLTT NKPVTTLEDL KGLKIRVSPN DIAIKTFRAW
GIEPLPMDWA EVFPALQQRV IDGQENPYTT AISSRFFEVQ SDITEIHYMM WTGPLLISER
AFQKYPEDIQ QALLRAGREA VDYGRQVSAE LTEQSKAELV KNDMTLHGAP KDEEKWEAAA
AALWPEFYDQ IGGEEWATQA IEIIKATEK