Gene Phep_3992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3992 
Symbol 
ID8255126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4819046 
End bp4820176 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content43% 
IMG OID644937656 
Productalkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen 
Protein accessionYP_003094245 
Protein GI255533873 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1225] Peroxiredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.984971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTA AAATAATTAT TGTTGCGCTC CTGCTACTGC CAGCAAGTTT TTGCCTGGCG 
CAGGATAAAT TTGAAATATC GGGCCAGCTT TCCCAAGCAG GTAAGGATAT GATGGTAATG
CTCAGCTACA AAAACAGCGA GGGTAAAGAC ACCAAAGACA GCGCCCTGGT AAAAAACGGT
AAATTCCTGA TCAGCGGTAC TACAGCTTTT GGTAATAAAG CGTATCTGAC ACTAATGCCT
GTAAAAAAGG ATACTATCCG ACGTGTCGGT CAATCTGATT ACCAGGAGTT TTACCTGGAA
AAGGGCATGT ATAAAGTAAC GGGTACAGAT AGCCTGGCCA AAGCAAGCAT AACGGGAGCG
CAGGCGCAAA AGGATTTTTT ATTATGGAAG TCTAAATCAC AGGCCCTGTT GGCCCAGTTC
AGAGAGATCA CCCAGCGGTT TACCAAGGTT TATTATGCGA AAGTGAAAGA TACGGTAACG
ATCAAAAAGA TTCAGGCCGA AGCAAGACCA GTACATGCCA AAATCGAAGC GGCATTGGAT
TCCTTTATTT TTAGTCACCC GGATTCCTAT GTGGCGCTTG ATCTGATTGC TTCAGAAAAG
ACCGCAGTAA TCGATCCCCA GGTTTTTGGG GCTTATTACA ATCCACTAAG CAAAAGAGTA
CTGGCCAGTT TTACTGGTCA GAAATTAACT GCCAAATTTG AAAAGGCAAA GAAAATATCC
ATTGGTAAAA CTGTAGACTT TACGCAAACA GACGATAAAG GCAATGAATT CAAGCTTTCT
TCATTAAAAG GAAAATACGT ACTGGTCGAT TTCTGGGCCA GCTGGTGCGT GCCTTGCCGT
GCAGAAAATC CACATTTGCT AAAAGCTTAT AACCAGTTAA AAGATAAGGG ATTTGAAATT
GTAGGGATTT CTCTGGATGA AACCAAAGCC GCCTGGCTTA ATGCTGTAAA GCATGATGGC
ATGCCCTGGA TACAGGTGAG TGACCTGAAG GGCTTTAAGA GTGAAATTGC AGTTCAATAT
GGTATCTCTG CCATTCCTCA AAACTTTCTG ATCGACCCAC AAGGCGTTAT CATAGCGAAG
AACCTAAGGG GTGAAGATGT AAATGAGCAG CTTGCGAAGC TGATCCGTTA G
 
Protein sequence
MNIKIIIVAL LLLPASFCLA QDKFEISGQL SQAGKDMMVM LSYKNSEGKD TKDSALVKNG 
KFLISGTTAF GNKAYLTLMP VKKDTIRRVG QSDYQEFYLE KGMYKVTGTD SLAKASITGA
QAQKDFLLWK SKSQALLAQF REITQRFTKV YYAKVKDTVT IKKIQAEARP VHAKIEAALD
SFIFSHPDSY VALDLIASEK TAVIDPQVFG AYYNPLSKRV LASFTGQKLT AKFEKAKKIS
IGKTVDFTQT DDKGNEFKLS SLKGKYVLVD FWASWCVPCR AENPHLLKAY NQLKDKGFEI
VGISLDETKA AWLNAVKHDG MPWIQVSDLK GFKSEIAVQY GISAIPQNFL IDPQGVIIAK
NLRGEDVNEQ LAKLIR