Gene Rsph17025_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3251 
Symbol 
ID5086000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp119423 
End bp121081 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content69% 
IMG OID640484823 
Producthypothetical protein 
Protein accessionYP_001169440 
Protein GI146279282 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.367816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.217501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATAA AGATCAAGCT CGCCGCAGCG TTCCTTGCGG TTTTTGTGCT GGCGGGAGCA 
GCTGGTGTGC TCGCCGTGCG CGGGTTCAAC TCGCTCGATG CGCAGCTCGA CGCCATGCTC
GACGGCACCG TTCACGCGGC GATCCAGGCC GATGCGCTCA ATGCCGCGCA GCTGCGGCTG
AAGGCGGCGA TCCGCGAGCA TCTCATCAGC CAGGACGCGG CCACCAAGAA GGCCCGCGAA
GAGGAGATGG CGGTCGCGCG CGCCGAGCAG AAGGAAGCGA TGACGGCGCT CGAGACGGCG
GCTCTCGCGC CCGCTCAGCG CGCGCTGCTC GATGAATACA GCTCGCTCCG CGAGGTGATC
TCGAAGGTGA ACAACGAAGC GGTCGAGTTC TCGAGCCGGA ACGATCTGGC CAATGCCAGC
AGGCTTCTGC TCGCGCCCGA CTATCTGGCG ATGCAGTCCA GGCGGGAAGG GCTGATTGCC
CAGCTTGTCG AGGCCGAGCA GAAGGAACTT GAGGCGTTGC GTCTGGAGGC GGACCGCCAT
ACGCGCGAGG CGCGCCAGAT GCTGATCGGC ATGTTCGCTC TGGCCGGTGT GGTCGGCACC
GCCGCGGCCG TCTGGATCAC GGTCTCGATC AGCCGCGGTC TGCGCAAGGC TCTCGATCTG
TCGCGGCGCG TGGCCGAGGG CGACCTGACC GAGATGGCCG ATGCCCGCGG CCGTGACGAG
ATTGCCGAAC TCCTCCGCTC GAACAATCTC ATGGTCGAAA AGCTGCGCGA GGTGGTGGGC
GGCGTCACGA CCGTGGCGCA GCAGGTCTCG TCCGGCAGCG GCGAGATGGC CTCGACCTCA
GAGCAACTCA GCCAGGGGGC GAGCGAGCAG GCTTCGGCCA CCGAGGAAGC CTCGGCTTCG
GTCGAGCAGA TGGCGGCGAA CATCAAGCAG GCCGCCGACA ATGCGAGTCA GACCGAGCGG
ATGGCGACCA AGGCCGCCGA AGACGCCCGC GCCTCGGGGC AGGCCGTGAC CGAGGCCGTG
GCCGCCATGC GCTCGATCGC CGACAAGATC CTCGTGGTTC AGGAAATCGC CCGCCAGACG
GACCTGCTGG CGCTGAACGC CGCGGTCGAG GCCGCGCGGG CGGGCGAGCA TGGCCGCGGG
TTCGCCGTCG TGGCCTCCGA AGTGCGCAAG CTCGCCGAGC GCAGCCAGAC CGCCGCGGCC
GAGATCTCGT CGCTTTCGAC GGGCACCGTC CGCGCCGCCA CGGGCGCGGG CGAGATGCTG
AACCAGCTTG TGCCCGACAT CGAACATACC TCGCGCCTCG TGACCGACAT CTCGGTGGCC
TCGCGTGAAC TGGCGGCCGG GGCCCAGCAG GTCGCGACGG CGATCCAGCA GCTCGACAAG
GTGACCCAGC AGAACAGCGC AGCCTCGCAA CAGCTTGCGG GTGGTGCTTC CGAACTGTCG
GGCCAGGCCG CGCGGCTCGA GGAGACGGTG CGTTTCTTCA CGTTGAACGA GCAGGCGCTG
GCCAGCGCCC CGGCGCCGCA GCTGCGGGTC GTGCAGGGCG GACGGGTGGA AGCCGCGGCC
GCGCCGCCCC AGCGCAAGGT GGCCTCGGGG GGCTTCAGTT TCAGCCTCGA CGGCACGGAC
GATGAGCTGG ACCGCGCCTT CCACCGTCAG GGCCAATAG
 
Protein sequence
MTIKIKLAAA FLAVFVLAGA AGVLAVRGFN SLDAQLDAML DGTVHAAIQA DALNAAQLRL 
KAAIREHLIS QDAATKKARE EEMAVARAEQ KEAMTALETA ALAPAQRALL DEYSSLREVI
SKVNNEAVEF SSRNDLANAS RLLLAPDYLA MQSRREGLIA QLVEAEQKEL EALRLEADRH
TREARQMLIG MFALAGVVGT AAAVWITVSI SRGLRKALDL SRRVAEGDLT EMADARGRDE
IAELLRSNNL MVEKLREVVG GVTTVAQQVS SGSGEMASTS EQLSQGASEQ ASATEEASAS
VEQMAANIKQ AADNASQTER MATKAAEDAR ASGQAVTEAV AAMRSIADKI LVVQEIARQT
DLLALNAAVE AARAGEHGRG FAVVASEVRK LAERSQTAAA EISSLSTGTV RAATGAGEML
NQLVPDIEHT SRLVTDISVA SRELAAGAQQ VATAIQQLDK VTQQNSAASQ QLAGGASELS
GQAARLEETV RFFTLNEQAL ASAPAPQLRV VQGGRVEAAA APPQRKVASG GFSFSLDGTD
DELDRAFHRQ GQ