Gene Phep_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1076 
Symbol 
ID8252170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1266956 
End bp1268218 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content34% 
IMG OID644934727 
Productalkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen 
Protein accessionYP_003091356 
Protein GI255530984 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.626941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00953303 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATAC AATTAAAATT TCTTGCTGCC ATAATGGTTG TAAGCATGCC GTGCTTTGCA 
CAAAATTTAG CCTTGAGGGT AGGGCAACAA CTGCCAGATC TGGAAATCAA TAAAATCTTA
TATCATATAT CTGGGACGGC TAAATTGTCT GATTTTAAAG GCAAGCTTGT TATCCTAGAT
TTTTGGTCAA CGGCATGTGG CCCCTGCATA GAAGGAATGC CAAAAATGGA TTCATTGCAG
AAAGAGTTTA GCGATAAAGT AGTTATTCTA CCCGTCTATG CATTTGGGAA GACCTTTACT
GTTGATATTA TTGAAAGAAT AGATTCATTT TGGAAAACAA ATAAGTATAC GTCAAGTACA
AATTTACCTT CTGTTTTGGA CTCGGCTTTT GGTAGTTTTT TTCCTGTAAG AGTAGGATAT
CAGGTATGGA TTGATAGCAA TCGAATTGTT CGTGCTGTAA CAGGACCCGA ATATGTAAAC
AAAAAAGAAA TCCATAAAGT AATAAATGGT TTATATCCAC AATGGGAAAG TGAAATAAAG
GAAAACTCAT TGAATAAAAC GGTAAGTGAG TATTTATCCA GAGCAGAATT TTCAGGATTG
AAGGATCAGA ACTATTCTTT TTTTACAAAC TACTTAAAGC GAGTAGACAC TAATAGCGAT
ATCGTTAAAT CAGATTCGCA AGTTCTTGTG AGTTATAAAA ACTGTCCAAT AATAAGTTTT
TATTGGAAAT ACATACGTGC TAATAACTTG TTCCCTACGA ATGTTGTAAT AGAAACTAAA
GACTCTGTCC GTTATTTCAA AAAAGGTTAT ACAAACGAAT GGTTGAGAGA TCATAGTTAT
TGCTATGAAT TGAAGTTAAA CCACAAAGTG GCTGATAAGG ATGTAGCTGA TTACATTAGG
CAGGATGCCA ATAAGTATTT TGGCTTGAAT GGTAGATTGG AAACTAAGCG AGTAAGTTGT
TTGATACTTA AAAAAGTAAA AAAAAGTGCT GGAATGCCTA ATCCAGGCAG ATTGTCGAAA
ACTCCCATCT CACAATTTTA CGAGGAACTT AAGAGTAAGC AAAATTCATT ATGGCAGACG
GGGAAAAATG CTTTGCCTAT ACTGAATGAA ATAGATCAAA AATCCATTTC AGAAATAAGT
TACCCACAGG CACTTGATTT AACAAATATC CTGGAATTAC GTAAAGTATT ACGGAGCCAG
GGTTATGATA TAGTTGAAGA AAAGCGCACA TTAAAAACCT TTGTGATATC AGAAATTAAC
TAA
 
Protein sequence
MKIQLKFLAA IMVVSMPCFA QNLALRVGQQ LPDLEINKIL YHISGTAKLS DFKGKLVILD 
FWSTACGPCI EGMPKMDSLQ KEFSDKVVIL PVYAFGKTFT VDIIERIDSF WKTNKYTSST
NLPSVLDSAF GSFFPVRVGY QVWIDSNRIV RAVTGPEYVN KKEIHKVING LYPQWESEIK
ENSLNKTVSE YLSRAEFSGL KDQNYSFFTN YLKRVDTNSD IVKSDSQVLV SYKNCPIISF
YWKYIRANNL FPTNVVIETK DSVRYFKKGY TNEWLRDHSY CYELKLNHKV ADKDVADYIR
QDANKYFGLN GRLETKRVSC LILKKVKKSA GMPNPGRLSK TPISQFYEEL KSKQNSLWQT
GKNALPILNE IDQKSISEIS YPQALDLTNI LELRKVLRSQ GYDIVEEKRT LKTFVISEIN