Gene Dret_1885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1885 
Symbol 
ID8419728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2166314 
End bp2167564 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID645038471 
ProductDi-heme cytochrome c peroxidase 
Protein accessionYP_003198747 
Protein GI258406005 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0712604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00642624 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCTTCA ATGTCGTACT GACCACCACC ATCTGCCTCG CCCTGCTTCT CCCCTCGCTT 
GGTTCGGGCG CTGAGGACAT CCAATGGACG GACCAGGAGA AGCGTCTTAT CGCCAGCATG
CAATTGGACA AGCTGCCGCC CCTGCCGGAC GATCCGTCGA ATGATGTGGA CACGGACCCG
GCTGCGGCGC GCTTTGGGGA AAAGGTCTTT CACGATGCCC GGTTCAGTGC CAACGACAAG
ATTTCCTGCG CCACCTGCCA CCCGGAAGAC AAATCCTTCC AGGACGGGCG TCCAGTGGCC
GTCGGCGTGG GCCGCGTGAC GCGGCGGACC ATGCCGCTTA TCGCTGTGGC CTATAACGAC
TGGTTTTTCT GGGACGGACG CAAGGACACC CTCTGGAGCC AGACGCTGGC CCCGATCGAA
AACCCCCGGG AACACGGTAT CAGCCGGACC GGTTGTGTGG AGTTGATCCG GTCCCAATAC
CGCGACGAGT ACGAGGCGGT TTTCGGGCCG CTCCCCAAGC TGCCCGCCAA CCTCCCGTCC
ATCGCCATGC CCGTGGAATT CGATCAGGAC GCCCTCGAGG CGTGGCGGGA GCTTGATCCG
GCGGTGCGGG ACACAATCAA CACCATTTTC GCCAACACCG GCAAGGCACT GGCCGCCTAT
GTCCGCGTTA TCCTGCCCAG CGAAGCGCCG TTTGACCGCT TCGCCGCCGA CTTGGCAGCC
GGCAACGAGA ATCAGGCCGA TGGCCATCTG TCACAAAAAC AGCAAGCCGG GCTGAAGCTC
TTTATCGGCG ATGCGGGCTG CTTCGGCTGC CACTTCGGTC CCCGGCTGAC CAACGACGGC
TTCCACGACA CCGGCGTCAA CGAGGCCTTC GGTCCGGAAT TCGATGCCGG ACGGGCCAAA
GGCATTACCC AGGTCCAGCA CGACATGTTC AACTGCATGG GGGACTATTC CGACGCCGAG
CCCAAGGAGT GTTCGGCGCT GCGATTCATG GACACGGATC AGGACAAATA CCGCCAGGCC
TTCAAGACCC CGACACTGCG CAACGTGGCT GTCCGCCCGC CGTACATGCA CGCCGGGCAG
ATCGAGACCC TGGAAGAAGT CATCGACTTC TACGCCAAGG AAAGCGAGAC CAATCCTGAA
CTGACCCACG CCGATCTGAC AGAAGAGAAA AAAGACGCGC TGATCGCCTT TATGGAGTCG
CTGACCAGCG ATGTCACGCC GAGGATGGAC GAAGTTTTGA ACTCTCAATA A
 
Protein sequence
MRFNVVLTTT ICLALLLPSL GSGAEDIQWT DQEKRLIASM QLDKLPPLPD DPSNDVDTDP 
AAARFGEKVF HDARFSANDK ISCATCHPED KSFQDGRPVA VGVGRVTRRT MPLIAVAYND
WFFWDGRKDT LWSQTLAPIE NPREHGISRT GCVELIRSQY RDEYEAVFGP LPKLPANLPS
IAMPVEFDQD ALEAWRELDP AVRDTINTIF ANTGKALAAY VRVILPSEAP FDRFAADLAA
GNENQADGHL SQKQQAGLKL FIGDAGCFGC HFGPRLTNDG FHDTGVNEAF GPEFDAGRAK
GITQVQHDMF NCMGDYSDAE PKECSALRFM DTDQDKYRQA FKTPTLRNVA VRPPYMHAGQ
IETLEEVIDF YAKESETNPE LTHADLTEEK KDALIAFMES LTSDVTPRMD EVLNSQ