Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1885 |
Symbol | |
ID | 8419728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2166314 |
End bp | 2167564 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645038471 |
Product | Di-heme cytochrome c peroxidase |
Protein accession | YP_003198747 |
Protein GI | 258406005 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0712604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00642624 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCTTCA ATGTCGTACT GACCACCACC ATCTGCCTCG CCCTGCTTCT CCCCTCGCTT GGTTCGGGCG CTGAGGACAT CCAATGGACG GACCAGGAGA AGCGTCTTAT CGCCAGCATG CAATTGGACA AGCTGCCGCC CCTGCCGGAC GATCCGTCGA ATGATGTGGA CACGGACCCG GCTGCGGCGC GCTTTGGGGA AAAGGTCTTT CACGATGCCC GGTTCAGTGC CAACGACAAG ATTTCCTGCG CCACCTGCCA CCCGGAAGAC AAATCCTTCC AGGACGGGCG TCCAGTGGCC GTCGGCGTGG GCCGCGTGAC GCGGCGGACC ATGCCGCTTA TCGCTGTGGC CTATAACGAC TGGTTTTTCT GGGACGGACG CAAGGACACC CTCTGGAGCC AGACGCTGGC CCCGATCGAA AACCCCCGGG AACACGGTAT CAGCCGGACC GGTTGTGTGG AGTTGATCCG GTCCCAATAC CGCGACGAGT ACGAGGCGGT TTTCGGGCCG CTCCCCAAGC TGCCCGCCAA CCTCCCGTCC ATCGCCATGC CCGTGGAATT CGATCAGGAC GCCCTCGAGG CGTGGCGGGA GCTTGATCCG GCGGTGCGGG ACACAATCAA CACCATTTTC GCCAACACCG GCAAGGCACT GGCCGCCTAT GTCCGCGTTA TCCTGCCCAG CGAAGCGCCG TTTGACCGCT TCGCCGCCGA CTTGGCAGCC GGCAACGAGA ATCAGGCCGA TGGCCATCTG TCACAAAAAC AGCAAGCCGG GCTGAAGCTC TTTATCGGCG ATGCGGGCTG CTTCGGCTGC CACTTCGGTC CCCGGCTGAC CAACGACGGC TTCCACGACA CCGGCGTCAA CGAGGCCTTC GGTCCGGAAT TCGATGCCGG ACGGGCCAAA GGCATTACCC AGGTCCAGCA CGACATGTTC AACTGCATGG GGGACTATTC CGACGCCGAG CCCAAGGAGT GTTCGGCGCT GCGATTCATG GACACGGATC AGGACAAATA CCGCCAGGCC TTCAAGACCC CGACACTGCG CAACGTGGCT GTCCGCCCGC CGTACATGCA CGCCGGGCAG ATCGAGACCC TGGAAGAAGT CATCGACTTC TACGCCAAGG AAAGCGAGAC CAATCCTGAA CTGACCCACG CCGATCTGAC AGAAGAGAAA AAAGACGCGC TGATCGCCTT TATGGAGTCG CTGACCAGCG ATGTCACGCC GAGGATGGAC GAAGTTTTGA ACTCTCAATA A
|
Protein sequence | MRFNVVLTTT ICLALLLPSL GSGAEDIQWT DQEKRLIASM QLDKLPPLPD DPSNDVDTDP AAARFGEKVF HDARFSANDK ISCATCHPED KSFQDGRPVA VGVGRVTRRT MPLIAVAYND WFFWDGRKDT LWSQTLAPIE NPREHGISRT GCVELIRSQY RDEYEAVFGP LPKLPANLPS IAMPVEFDQD ALEAWRELDP AVRDTINTIF ANTGKALAAY VRVILPSEAP FDRFAADLAA GNENQADGHL SQKQQAGLKL FIGDAGCFGC HFGPRLTNDG FHDTGVNEAF GPEFDAGRAK GITQVQHDMF NCMGDYSDAE PKECSALRFM DTDQDKYRQA FKTPTLRNVA VRPPYMHAGQ IETLEEVIDF YAKESETNPE LTHADLTEEK KDALIAFMES LTSDVTPRMD EVLNSQ
|
| |