Gene Rsph17025_3913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3913 
Symbol 
ID5085461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp814014 
End bp815276 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID640485471 
Producthypothetical protein 
Protein accessionYP_001170072 
Protein GI146279914 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.44156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA CCACGAAAAA GGGCACCACC CGCCGCGGCT TCCTGGCCGG GGCCGCCGCT 
GGCGGCGCGC TGGCCGCCAC CGCCGCGCGC GCGGGCGGAC CCGACCCGCT GATCACCGAA
GTGCAACCTT GGGCGCAGTC GCTGGGCGAC GGAGTGGACG CCACGCCCTA CGGAATGCCG
ATCCACTTCG AATCCCACGT CGTGCGGCGC AATGTGGAGT GGTTGACCGC GGACACGATC
AGCTCGGTCA ACTTCACCCC GATCCACGAG CTTGACGGCA CGATCACCCC GCAGGGCTGC
GCCTTCGAGC GTCACCACGC CGGCGCCATC GAACTGGCCA AGCCCGACTG GCGGCTGATG
ATCCACGGGC TTGTCGAACG GCCGCTCGTC TTCACCTACG ACGACCTGAT CCGCTTCCCG
CGCGAGAACC ACACATACTT CCTCGAATGC GCGGCGAACG GCGGGATGGA ATGGGCGGGC
GCGCAGCTCA ACGGCTGCCA GTTCACCCAC GGCATGATCC ACAACATGGA ATATACCGGC
ATCCCGCTGC GCACCCTTCT GGCCGAAGCC GGGGTGAAGC CGGACGGCAA ATGGGTTCTG
ATGGAAGGTG CTGACGCTTC CTCGATGAGC CGCTCCATCC CGATCGGGAA GGCGCTCGAT
GACGTGCTGG TGGCCTTCAA GGCCAATGGC GAGGCGCTGC GCAAGGAGCA TGGCTACCCT
GTCCGGCTGG TGGTGCCGGG TTGGGAGGGC AACATGTGGG TCAAGTGGCT GCGCCGCGTC
GAGGTGGGCG ACCAGCCGTG GGAGCACCGC GAGGAAACCT CGAAATACAC CGATACGATG
GCCGACGGCC GCTCGCGCCG CTGGACCTGG GAGATGGACG CCAAGTCGAT CATCACCAGC
CCCAGCCCGC AGGCGCCGAT CACTCACGGC CCCGGGCCCA CGGTGATCTC GGGCCTCGCG
TGGTCCGGAC GCGGCACGAT CCGCGAAGTG CATGTCTCGC TCGACGGCGG CAAGAACTGG
CAGCAGGCGC GCCTTTCGGG GGCGAGCCAC GACAAGGCGC TGCACCGCTT CTACTTCGAG
TTCGACTGGA ACGGCTCGGA ACTGCTGCTG CAATCCCGCG CGGTGGACAG CACCGGCTAC
GTCCAGCCGA CCAAGGAGCA GCTGCGCAGC TTCCGCGGCG TGAACTCGGT CTACCACAAC
AACGGCATCC ACACCTGGTG GGTCAAGGCG AACGGAGAGG CGGAAAATGT CGAGATTTCC
TGA
 
Protein sequence
MTDTTKKGTT RRGFLAGAAA GGALAATAAR AGGPDPLITE VQPWAQSLGD GVDATPYGMP 
IHFESHVVRR NVEWLTADTI SSVNFTPIHE LDGTITPQGC AFERHHAGAI ELAKPDWRLM
IHGLVERPLV FTYDDLIRFP RENHTYFLEC AANGGMEWAG AQLNGCQFTH GMIHNMEYTG
IPLRTLLAEA GVKPDGKWVL MEGADASSMS RSIPIGKALD DVLVAFKANG EALRKEHGYP
VRLVVPGWEG NMWVKWLRRV EVGDQPWEHR EETSKYTDTM ADGRSRRWTW EMDAKSIITS
PSPQAPITHG PGPTVISGLA WSGRGTIREV HVSLDGGKNW QQARLSGASH DKALHRFYFE
FDWNGSELLL QSRAVDSTGY VQPTKEQLRS FRGVNSVYHN NGIHTWWVKA NGEAENVEIS