Gene Rsph17025_3341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3341 
Symbol 
ID5085832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp218228 
End bp219202 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content64% 
IMG OID640484910 
Producthypothetical protein 
Protein accessionYP_001169527 
Protein GI146279369 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.699949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.164593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGA AATCCCTCAC GCTTGCAGCC CTGCTCGGCG CTGCCGCCGT TCTGCCTGCC 
GCAGCGCAGG AAGTGGTGGT GCGCGTGGCC TACGAGAACA ATCCCGGCGA GCCGACCGAC
CTCGTGATGA ACCGCTGGGC CGAGCTGGTT GCCGAAGCCT CGGACGGCAA CGTGGCGCTC
GAGCTCTATC CCTCGTCGCA GCTGGGCGCC AAGCAGGACG TGATCGAGCA GGGCCTGCTG
GGCGTCAACG TCATCACGAT CGCCGACGTG GGGTTCCTGA CCGACTATGA TCCCGATCTC
GGCATCCTCT TCGGGCCCTA TCTGACCGAC AGCCCCGAGC AGCTCTTCAA GATCTACGAG
AGCGACTGGT TCAAGGAGAA GGAAGCCGCG CTGCGCGAGA AGGGCGTGCA TATCGTCATC
TCGAACTACC TCTACGGCAC CCGGCAGCTT CTGGCGAAGA AGAAGGTCGA GACGCCGGAC
GATCTGGCCG GGATGAAGGT CCGCGTGCCC AACAACATCA TGCAGATCAA GGCGCTCGAA
CTGATGGGTG CCACGCCGAC GCCGATGCCG CTGGGCGATG TCTATCCGGC GCTGACCCAG
GGCGTCATCG ACGGCGTCGA GAACCCGCTG CCGGTGCTCT ATGGCGGCAA GTTCCACGAG
CAGGCCAAGG AGCTGTCGAT GATCAGCTAC CTGACCAACA CCTCGCTCTG GCTGGGCGGC
GAGGCCTATT TCTCGACCCT CGACCCCGAG GTGGTGACCA TGCTGCATGA GACGGGCCAT
CAGGCCGGCC TCTACAGCCA GGAGCTGGCG GCGCAGGAAG AGGGCAAGAT GATCGAAGCG
ATGAAGGCCG CCGGCGTGAC GGTGACCGAG CCCGACGTCG AGGCCTTCCG CGAAAAGACC
AAGGCCTTCT ACACCATGTT CCCGGAATGG TCCGAGGGGC TATACGAGCA GATCCAGGCG
GCTCTCGCCC AGTGA
 
Protein sequence
MTLKSLTLAA LLGAAAVLPA AAQEVVVRVA YENNPGEPTD LVMNRWAELV AEASDGNVAL 
ELYPSSQLGA KQDVIEQGLL GVNVITIADV GFLTDYDPDL GILFGPYLTD SPEQLFKIYE
SDWFKEKEAA LREKGVHIVI SNYLYGTRQL LAKKKVETPD DLAGMKVRVP NNIMQIKALE
LMGATPTPMP LGDVYPALTQ GVIDGVENPL PVLYGGKFHE QAKELSMISY LTNTSLWLGG
EAYFSTLDPE VVTMLHETGH QAGLYSQELA AQEEGKMIEA MKAAGVTVTE PDVEAFREKT
KAFYTMFPEW SEGLYEQIQA ALAQ