Gene Rsph17025_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2044 
Symbol 
ID5082658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2087597 
End bp2088580 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content67% 
IMG OID640483606 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001168240 
Protein GI146278081 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.289699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGG GACGTATCAC TGTCATCGCG GCCTTTGCCG CATCGGTCGC CGCAACCTCG 
GCTTCGGCGC AGACTCACTG GGACATGTCG ATCGTCTGGC CCGAGGGCAA CTTCCACACC
CAGAACGCGA TGGCCTTCGC GGATGCGGTG CGCGAGGCCA CCGACGGCGA GGTGGAGATC
ACGGTCCATT CCGGCGGCGC GCTCGGCATC AAGGGACCGG AGGGCATGGC GGCCGTTCGG
GACGGTCTGG TGCCGATCGC CGAGATCCTG CTCAACCAGC AGGTGGGCGA GGCGCCGGTG
CTGGGCATCG AGACCCTGCC CTTCCTCGCC CCTTCGATGT CCGAGCTGGC GCTCCTGCAC
AAGTTCTACC GGCCCAAGCT GGACGAGGTT GCCGCGGGGA TGAACCAGAA GATCCTCTAC
ATGGTGCCGT GGCCGGGACA GGCCGTCTTC GCGCCCAACC CCATCAACAC GGTCGAGGAT
CTGAAGGGCC TGACCATCCG CGTCGTCGAT TCGAACGGCA ACGACTTCTT CGGCGCGCTC
GGGGCCACCC CGATCCAGAT GCCCTGGGGC GAGGTCGTGC CCTCGCTGGC GGCGGGGACG
ATCAAGGGCG TGACGACCTC CTCCTCCTCG GGGGTGGACG GCGCCTTCTG GGAATTCACC
AAACACATGA GCACCTTCAA CTGGCAGGCC TCGTCGAACA TCATGTCGGT GAACCTCGAT
GCCTGGAACG CGCTGCCCGC CGAGACTCAG GCCGCGATCG AGGAGACCGC GGCCAACCTC
GAGGGCCAGT TCTGGCTCAA CAGCCGCGCC GAGGACGACA AGAAGCTCGC CACCCTGCGC
GAGAACGGCG TCGAGATCAC GGCGCCTTCG CCCGAACTGT CGGCGGCGCT GATGGAAAAG
GCCCTGCCGC TCTGGGAGGC CTTCAAGACC CGCGTGCCCG AGGCCGCGCC GGTGATCGAC
GCCTATCTCA TGCTGCGCGA CTGA
 
Protein sequence
MKQGRITVIA AFAASVAATS ASAQTHWDMS IVWPEGNFHT QNAMAFADAV REATDGEVEI 
TVHSGGALGI KGPEGMAAVR DGLVPIAEIL LNQQVGEAPV LGIETLPFLA PSMSELALLH
KFYRPKLDEV AAGMNQKILY MVPWPGQAVF APNPINTVED LKGLTIRVVD SNGNDFFGAL
GATPIQMPWG EVVPSLAAGT IKGVTTSSSS GVDGAFWEFT KHMSTFNWQA SSNIMSVNLD
AWNALPAETQ AAIEETAANL EGQFWLNSRA EDDKKLATLR ENGVEITAPS PELSAALMEK
ALPLWEAFKT RVPEAAPVID AYLMLRD