Gene RoseRS_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3631 
Symbol 
ID5210609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4539980 
End bp4541152 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content63% 
IMG OID640597224 
Productcytochrome-c peroxidase 
Protein accessionYP_001277936 
Protein GI148657731 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGTA CTGGCGCCGC CATACTCCGC ATGCAGTTTT GCTTTGTCAT GATGCTACGT 
CCAAAGGTGC ACGCTGTTCG TCTGCCGCTT GCAATTGCCC TGGGTGGTGC AGCGCTGCTG
ATTGTGGCGT TTGTCGGTGG AATACAACGG TTGACGCCGG GCGCCGCAAC GCCAGCATCC
GCCGGTTCGC CTCGCGCGAT GATCGAACTT GGTCGCTGGC TCTTTTATGA CCGGCGCCTC
TCGGCAAATG AACGCATCGC CTGCGCCACC TGTCACCGCC AGGAACTGGG GTTTAGCGAC
GGGCGTGTCG TTTCGATTGG CGCTACCGAC GTACCGCTGC GCCGAAATAC GCCGGGATTG
TTGAATAGCG GTGAATTGAC GGCGCTCACC TGGGCGAATC CACAGGTTCG CACCCTGGAA
CACCAGATAG CGCGCGCGCT CTTCGCCGCC GATCCGCCCG AAATGGGGGT GGCAGGCAAT
GAGCAACGTG TGATCGACAG GCTTCGCGCA GACCCGGATT ATCGCCAGCG ATTCGCTGCT
GCGTTCCCCG CAGATGACGA TCCATTCACC TGGGATCGTG TGATCGAAGC GCTGGCAGCT
TTTACCCGTT CGCTGCACGG GCGGAACACC CCGTATGACC GGTATATTTA CCACGGGGAG
ACCACAGCCC TCACCGAAAG CGCTCGACGC GGCATGGCAC TCTTCTTTTC GCCAGGTCTG
GCATGCGGTC ACTGCCACGT CGATCTGGTT CCACCCAATC GCGCCGCGCC GCCACGCTGG
TCCGACCTGG CGTATGTGGC GACAGGCACC GGGCGTAGCG CCGATCGCGG GCTGGCAGAG
CATACCGGCG CTGCAACCGA TGCATACCGG TTCCGCGTGC CGCCGCTGCG CAACGTGGCA
GTAACGGCGC CCTACATGCA CGACGGCAGC CTGCCAACCC TCGATGCGGT CATCCGCTTC
TACGAATCGG GGGGACAACT GGACGCGGGT TCTGAGCTGG AGCGCCGCGC TGCGCGCCAC
CCGCTCGTTG CCGGTTTTGT GCTGAGCGAC GATGAACGCC GCGACCTGAT CGTATTCCTC
GAATCGCTGA CCGACGCCGA CGCATTGCAA TCGCTGGCAT TTGCCAATCC GTTCAATGGA
CCCCGTCTGT CCATCGCAGA TCCAGGCAGG TAA
 
Protein sequence
MGGTGAAILR MQFCFVMMLR PKVHAVRLPL AIALGGAALL IVAFVGGIQR LTPGAATPAS 
AGSPRAMIEL GRWLFYDRRL SANERIACAT CHRQELGFSD GRVVSIGATD VPLRRNTPGL
LNSGELTALT WANPQVRTLE HQIARALFAA DPPEMGVAGN EQRVIDRLRA DPDYRQRFAA
AFPADDDPFT WDRVIEALAA FTRSLHGRNT PYDRYIYHGE TTALTESARR GMALFFSPGL
ACGHCHVDLV PPNRAAPPRW SDLAYVATGT GRSADRGLAE HTGAATDAYR FRVPPLRNVA
VTAPYMHDGS LPTLDAVIRF YESGGQLDAG SELERRAARH PLVAGFVLSD DERRDLIVFL
ESLTDADALQ SLAFANPFNG PRLSIADPGR