Gene Rcas_0505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0505 
Symbol 
ID5537968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp653563 
End bp654660 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content62% 
IMG OID640892667 
Productcytochrome-c peroxidase 
Protein accessionYP_001430653 
Protein GI156740524 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0442639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCT TCGTCCATTT TTTTAGCCTG GCGCTATTGA TTGCTGTGTT CATCACAGTG 
ATACGCGGAG ATTCGCCTAT TGCCAGAACG CCAGCCCCCG GCGAGTCGCC GCAGGCGATG
ATTGCATTGG GACGCCGGCT CTTTTACGAC CGACGCCTGT CGGCAAACGA ACAGATCGCG
TGCGCTGCCT GTCACCGCCA GGAATTGGGG TTCAGCGATG GACGGGTTGT TTCGAACGGC
GCCACAGGCG CACTGCTGCG GCGGAATACG CCAGGGTTGT TCAATAGCGG CGAACTCCTG
GCGTTTACCT GGGCGAATGT TGAGGTGCGA ACGCTGGAAC AGCAGGTGGA GCGCGCGCTT
TTTACCGTTG ACCCGCCTGA GATGTGGGTG AGAGGGTATG AGACCACGGT GATCGACCGC
CTGCGCGCCG ATCCAGAATA TCTGCGTCAA TTCACTGCTG CGTTTCCCGC AGATGACGAC
CCGTTCACCT GGCGACGCAT CACCGGGGCG CTGGCAGCCT TTGTTCGCTC GCTGGCTGCG
CGCAACACGC CATACGACCG ATACGTCTAT GCTGGCGACC GTGCGGCATT GAGCGACAGT
GCGCAACGAG GCATGGCGCT CTTCTTTTCG CCAGGGCTGG CGTGCGGTCA TTGTCATGTT
GATGTTCCGT CGCCGGAGCG CGCCACGCCG CCACGCTGGT CCGATCTGGC ATATGTGGCG
ACGGGCGCCG GGTACAGCGC AGATCGCGGT CTAGCGGAGC AGACCGGCAA TCCGGCGGAT
GCCTACCGAT TTCGCGTGCC GCCGTTGCGG AATGTGGCGG TGACTGCACC CTATATGCAC
GACGGAAGCC TGCCTACCCT CGAGGCGGTC ATCCGATTTT ATGAGTCCGG CGGGCGATGG
GGCGCCGGCG TGGAACCGGA ACGCGTCGCC GCCCGTCACC CGCTGATCGC CGGTTTTGCG
CTGAGCGACG AGGAGCGTCG CGATCTGATA GCCTTTCTCG AAGCGCTGAC CGACGATGAA
GCGTTGCGGA ACCCGGCATT TGCCGACCCG TTCTTATCCG ATGCGCGAAC CCCGTCTGTT
CGCTCGCTCT CCCGGTAA
 
Protein sequence
MQRFVHFFSL ALLIAVFITV IRGDSPIART PAPGESPQAM IALGRRLFYD RRLSANEQIA 
CAACHRQELG FSDGRVVSNG ATGALLRRNT PGLFNSGELL AFTWANVEVR TLEQQVERAL
FTVDPPEMWV RGYETTVIDR LRADPEYLRQ FTAAFPADDD PFTWRRITGA LAAFVRSLAA
RNTPYDRYVY AGDRAALSDS AQRGMALFFS PGLACGHCHV DVPSPERATP PRWSDLAYVA
TGAGYSADRG LAEQTGNPAD AYRFRVPPLR NVAVTAPYMH DGSLPTLEAV IRFYESGGRW
GAGVEPERVA ARHPLIAGFA LSDEERRDLI AFLEALTDDE ALRNPAFADP FLSDARTPSV
RSLSR