Gene Rcas_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2024 
Symbol 
ID5539502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2596402 
End bp2597874 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content65% 
IMG OID640894159 
Productprotoporphyrinogen oxidase 
Protein accessionYP_001432130 
Protein GI156742001 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.364182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA TGCATTCGAC ATCGGCGGCA ACGCTGTTCC CGGGCGGTCA ACCGCACATC 
GTTGTGGTAG GCGGCGGGAT CAGCGGCATG AGCGCGGCAT ATGAACTGGG GCGCGCAACG
CGCGACGGGG CGCCGCCGGT GATGGTCACG CTCATCGAGC GTGAGGCGCG TTTAGGGGGC
AAGGTCGTCA CCGAGCGCAA CGGACCCTTC GTCATCGAAG GCGGACCCGA CTCGTTCATG
GCGCAGAAAC CATGGGCTGC CGAACTGGCG CGTGAGATCG GTCTGGGTGA CGAGTTGATG
GTCGCCTCGC CGATGCGCCG CACGACATGG GTGCTGATCC GTGGACGACC GCAACCGCTC
CCCGAAGGCA TGCTCCTGAT CGTCCCCACA CGCATCGCAC CCTTTGCCTT CTCGCCGCTG
ATTTCTCCCC TTGGAAAACT TCGTATGGCG CTCGACTTGT TCGTTCCGGC GCGCCGTGAC
GATGGCGATG AGACGCTCGC CGACTTTATT CGCCGTCGCC TGGGGAATGA GGCGCTTGAT
CGTCTGGCGG AGCCGATCCT CTCCGGCATT CACAGCGCTG AGTGCGAACG CCAGAGCATT
CTGGCGACCT TTCCGCGCTT CCGCGAGTTG GAGAAACGCC ATGGCAGTCT GATCCGCGGC
ATGCTTGCAG CGCGGCGCAC CGCGTCACCC TCTTCAGCGC ATCAGTCGCC CTTCATGACG
CTGCGCGGCG GCATGGGGAC GCTCGTCGAG CGGTTGGAAC AACGTCTCAC GGCGCGTATC
CTGACCAACC GCCGGGTGAT GGCGCTCACC TGTGATACAA CCGCTGCGCG TCCCTATCGT
CTGTGGTTGG ACGACGGCGC CACGCTGGAT GCCGATGCCG TCATTCTGGC GACGCCATCC
TACGCCGCTG CTGACCTCGT CGGTGCATCG TTCCCGGCGC TGGCGGATGC GTTACGCGCC
ATCCGGTACG TTTCGACCGC CACGGTCTCA CTGGTCTACC GGCGCAGTGA GGTCGGGACG
CCGCTCGATG GCTATGGTCT GGTCATTCCG CGCAGCGAAC AGACCTGGAT TAATGCATGC
ACCCTCTCCT CGGTTAAGTT TCGCCATCGC GCGCCCGATG AGTATCTGTT GCTGCGCTGC
TTCGTCGGCG GATCGCGTCG TCCAGAACTG CTGGCGCGGG ACGATGACGA CCTGGTGCGC
ATGGCGCAGT CCGATCTGCG CGCCGTTCTG GGCATCACCG CCGTGCCGCT GCTGACGCGC
GTGTATCGCT GGCATAACGG CAACCCGCAG TATGATGTCG GGCATCTGGA ACGAATCGCC
GCGCTCGAGG CGCTTTGTCC GGCGGGTCTT TTGCTGGCCG GCGCCGCGTA TCGTGGCGTT
GGCGTGCCCG ACTGCATCAA ACAGGGGCGT GAGGCGGCGC GTCGGGCGCT CGATGTGGTT
GCGACCGCTC GCTATCCGGT GATGGAAAAG TAA
 
Protein sequence
MTAMHSTSAA TLFPGGQPHI VVVGGGISGM SAAYELGRAT RDGAPPVMVT LIEREARLGG 
KVVTERNGPF VIEGGPDSFM AQKPWAAELA REIGLGDELM VASPMRRTTW VLIRGRPQPL
PEGMLLIVPT RIAPFAFSPL ISPLGKLRMA LDLFVPARRD DGDETLADFI RRRLGNEALD
RLAEPILSGI HSAECERQSI LATFPRFREL EKRHGSLIRG MLAARRTASP SSAHQSPFMT
LRGGMGTLVE RLEQRLTARI LTNRRVMALT CDTTAARPYR LWLDDGATLD ADAVILATPS
YAAADLVGAS FPALADALRA IRYVSTATVS LVYRRSEVGT PLDGYGLVIP RSEQTWINAC
TLSSVKFRHR APDEYLLLRC FVGGSRRPEL LARDDDDLVR MAQSDLRAVL GITAVPLLTR
VYRWHNGNPQ YDVGHLERIA ALEALCPAGL LLAGAAYRGV GVPDCIKQGR EAARRALDVV
ATARYPVMEK