Gene Rsph17025_3275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3275 
Symbol 
ID5085766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp145822 
End bp147444 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID640484847 
Producthypothetical protein 
Protein accessionYP_001169464 
Protein GI146279306 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.349389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC TGCTCGCCTC CACCGCCGTG ATACTGGCGC TTGCCTTGCC CGCGGCCGCG 
CAGGACTACA CGCCCGACCC GAACGCCAGG CCCGGCGGCG CTATCACCAT CACCTACAAG
GACGATGTGG CGACGCTCGA CCCGGCGATC GGTTACGACT GGCAGAACTG GTCGATGATC
AAGTCGATCT TCGACGGGCT GATGGATTAC GTCCCCGGCA CGACCGAGCT GCGCCCCGGC
CTCGCCGAAA GCTACGAGAT CTCGGAGGAC GGGCTCACCT ACACCTTCAA GCTGCGCCCG
GGCGTGACAT TCCACAACGG CCGCGAGATG GTGGCCGAGG ATGTGAAATA TTCGCTCGAT
CGCGTGACCC TGCCCGAGAC GCAATCGCCC GGCGCCGGCT TCTTCGCCTC GATCAAGGGC
TTCGACGCGA TCTCGGACGG CTCGGTCACC ACGCTCGAGG GCGTCACGGT GGTCGATCCC
GCCACTGTGC GGATCGAGCT TTCGCGTCCC GACGCCACCT TCCTGCATGT CATGGCGCTG
AACTTCGCCT CGGTCGTGCC GAAGGAAGCC GTCGAGGCCG CGGGCGCCGA TTTCGGCAAG
CAGCCGGTCG GCACCGGCGC CTTCAAGCTC GGCGAATGGA CGCTCGGCCA GCGGCTCGTC
TTCGAGAAGA ACGCCGACTA CTGGCGCGAC GGCGTGCCCT ATGTCGACAG CATCGTGTTC
GAGGTGGGCC AGGAGCCGAT CGTGGCGCTC CTGCGGCTGC AGAACGGCGA GGTGGACGTG
CCCGGTGACG GCATTCCGCC CGCGAAGTTC ACCGAAGTCA TGGCAGATCC GGCGCAGGCC
GAGCGCGTGG TCGAGGGCGG GCAGCTCCAT ACCGGCTACA TCACGCTGAA CGTGACCCAT
CCGCCCTTTG ACGATCTGAA GGTCCGACAG GCCGTCAACA TGGCGATCAA CAAGCAGCGC
ATCACGCAGA TCATCAACGG CCGCGCCGTG CCCGCGACCC AGCCGCTGCC GCCCTCGATG
CCGGGCTACA CCGAGGGCTA CGAGGGCTAC CCGCACGATG TCGAGAAGGC CAAGGCGCTG
CTCGCCGAGG CGGGTTTTGC CGACGGGTTC GAGACCGAAC TCTATGTGAT GAACACCGAC
CCGAACCCGC GGATCGCGCA GGCCATCCAG CAGGACCTGT CGCAGATCGG CATCAAGGCC
GCGATCCAGA GCCTTGCGCA GGCCAATGTG ATTGAGGCCG GCGGCAACGG CTCGGCGCCG
ATGATCTGGT CGGGCGGCAT GGCCTGGATC GCGGATTTCC CCGATCCGTC CAACTTCTAC
GGCCCGATCC TCGGGTGCGC GGGCGCCGCC GAGGGCGGCT GGAACTGGTC GAAGTTCTGC
GACGAGGCGC TCGACGCCAA GGCCACCGAG GCCGACAGCC TGGCCGATCC GGCCCGCGCC
GAGGAGCGGC TGAAGCTCTG GTCCGACGTC TACATGGGCG TGATGGAGAA GGCGCCGTGG
GTGCCGGTCT TCAACGAGCA GCGCTACACG ATGAAATCCG CGCGCATGGG CGGCGACGAC
AGCCTCTATG TCGATCCCGT CTCGATCCCC GTGAACTACG ACTATGTCTT CGTGACCGAG
TAA
 
Protein sequence
MKRLLASTAV ILALALPAAA QDYTPDPNAR PGGAITITYK DDVATLDPAI GYDWQNWSMI 
KSIFDGLMDY VPGTTELRPG LAESYEISED GLTYTFKLRP GVTFHNGREM VAEDVKYSLD
RVTLPETQSP GAGFFASIKG FDAISDGSVT TLEGVTVVDP ATVRIELSRP DATFLHVMAL
NFASVVPKEA VEAAGADFGK QPVGTGAFKL GEWTLGQRLV FEKNADYWRD GVPYVDSIVF
EVGQEPIVAL LRLQNGEVDV PGDGIPPAKF TEVMADPAQA ERVVEGGQLH TGYITLNVTH
PPFDDLKVRQ AVNMAINKQR ITQIINGRAV PATQPLPPSM PGYTEGYEGY PHDVEKAKAL
LAEAGFADGF ETELYVMNTD PNPRIAQAIQ QDLSQIGIKA AIQSLAQANV IEAGGNGSAP
MIWSGGMAWI ADFPDPSNFY GPILGCAGAA EGGWNWSKFC DEALDAKATE ADSLADPARA
EERLKLWSDV YMGVMEKAPW VPVFNEQRYT MKSARMGGDD SLYVDPVSIP VNYDYVFVTE