Gene RoseRS_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1744 
Symbol 
ID5208701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2156637 
End bp2158103 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content65% 
IMG OID640595350 
Productprotoporphyrinogen oxidase 
Protein accessionYP_001276084 
Protein GI148655879 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGA TGTATGCCAC ATCGACGGCG ATGCCGTTCT CGCATGATCA TCCCCACATC 
GTGGTGGTTG GCGGCGGGAT CAGCGGAATG AGCGCAGCGT ATGAACTGGG TCGCGCAGTG
CGCGACGGCG CTGCGCCAGT CGCCGTCACG CTGATCGAGC GCGAGGCGCG TTTAGGCGGC
AAAGTGGTGA CCGAACGGAA CGGCCCCTTC ATCATCGAGG GAGGTCCCGA CTCCTTCATG
GCGCAGAAGC CGTGGGCGGC TGAACTGGCG CGTGAAATCG GGTTGGGCGA TGAGTTGATG
GTCGCCTCGC CGATGCGCCG TACCACCTGG GTGTTGCACA ACGGGTGTCC CCACCCCCTG
CCCGAAGGCA TGCTGCTCAT TGTGCCGACG CGCATCGCTC CTTTCGCTTT CTCGCCACTG
ATTTCTCCGC TCGGTAAACT GCGCATGGCG CTCGACCTGT TCGTGCCAGC GCGTCGTGAC
GATGGCGATG AAACCCTCGC CGACTTCATC CGCCGACGCC TGGGGAATGA GGCGCTCGAC
CGTCTGGCGG AACCGATCCT CTCCGGCATC CACAGTGCAG AATGTGAGCG CCAGAGCATT
ATGGCGACCT TCCCGCGCTT CCGCGAGTTA GAGAAACGGC ACGGCAGCCT GATCCGCGGC
ATGCTTGCCG CGCGTCGCTC AATGCCGCCG TCTTCAGCGC ATCAGTCGCC CTTCATGACC
CTTCGCGGCG GCATGGGTTC GCTGGTCGAG CGGCTGGAAC AGCGCCTGAC GGCGCGTGTC
CTGACCAACC GCAAGGTCAT GGCGCTGGCA TATGATCCGA CTGCTGCGCG TCCATATCGT
CTGCGGCTGG ACGATGGCGC CACGCTGGAT GCCGACGCCG TCATCCTGGC GACCCCATCC
TACACTGCTG CCGATCTGGT GGACGAAGCG TTCCCGGATC TGGCGAGCGC TCTGCGCGCC
ATCCGGTACG TTTCGACGGC GACGATCTCG ATGGTCTACC GACGTAGTGA AGTTGGCACG
CCGCTCGATG GGTATGGGTT GGTGATCCCG CGCAGCGAAC AGACCTGGAT CAACGCCTGC
ACCCTCTCAT CGGTGAAGTT CCGCCACCGC GCCCCCGACG ATTATCTGCT GCTGCGCTGC
TTCGCTGGCG GATCGCGCCG CCCGGAACTG CTCGCGCGGG ACGATGACGA CCTGGTGCGC
CTGGCGCAGT CCGATCTGCG CGCTATCCTG GGTATCACCG CCGCCCCTGT GCTGACGCGC
GTCTATCGCT GGCACAATGG CAACCCGCAG TACGATGTCG GGCATCTCGA TCGCATCGCC
GCGCTCGAAG CGCGTTGTCC AGATGGATTG TTGCTTGCTG GCGCAGCGTA CCGCGGCGTC
GGCGTGCCCG ACTGTATCAA ACAGGGGCGG GACGCTGCGC GTCGGGCGCT GGCGCTGGTG
ACGTCTGTCC AACCGGCGAA AAGTTAA
 
Protein sequence
MTAMYATSTA MPFSHDHPHI VVVGGGISGM SAAYELGRAV RDGAAPVAVT LIEREARLGG 
KVVTERNGPF IIEGGPDSFM AQKPWAAELA REIGLGDELM VASPMRRTTW VLHNGCPHPL
PEGMLLIVPT RIAPFAFSPL ISPLGKLRMA LDLFVPARRD DGDETLADFI RRRLGNEALD
RLAEPILSGI HSAECERQSI MATFPRFREL EKRHGSLIRG MLAARRSMPP SSAHQSPFMT
LRGGMGSLVE RLEQRLTARV LTNRKVMALA YDPTAARPYR LRLDDGATLD ADAVILATPS
YTAADLVDEA FPDLASALRA IRYVSTATIS MVYRRSEVGT PLDGYGLVIP RSEQTWINAC
TLSSVKFRHR APDDYLLLRC FAGGSRRPEL LARDDDDLVR LAQSDLRAIL GITAAPVLTR
VYRWHNGNPQ YDVGHLDRIA ALEARCPDGL LLAGAAYRGV GVPDCIKQGR DAARRALALV
TSVQPAKS