Gene Phep_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3987 
Symbol 
ID8255121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4810871 
End bp4812181 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content42% 
IMG OID644937651 
Productalkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen 
Protein accessionYP_003094240 
Protein GI255533868 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTA AACTTTTGAT CCTCCTGGCA GTAGGTATAC TGGCCGGAGG CAATGCTTTT 
ACACAAACTA CAGTGGTTAC GATCGGGACA AAAATTCCGG GGCATACCTT AACCAGGCTC
ATCAACTTCC CAAGGCCGAG CGTTGGATTG ACAGATTTTA AGGGTAAAAT CCTGATTCTT
GATTTCTGGA ACCGTGGCTG TACCTCTTGT ATTGACTCCT GGCCAAAACT GATGAAGTTG
CAGGAAAAAT TTAAGGATAA CATCCAGATT CTGCTGGTAA ATGACCTGGA TGATGAGGCT
ACGATACGGG CATTGATCAA AAGGTGGTCA AAGACCTTCG AGATGCAGAT GACACTACCT
ACTGCTTATC AGGATAAAGT GGTAAATTCC ATGTTTCCTC ATCAAAGTGT CCCACATTTG
GTATGGATTG ATCAGGTTGG GACAGTTAAA TATATTTCTA TGGCCCAGTT TCTGAACCAT
GAGACCATTG AAAATATGAT CCTGGGAAAA GAAATGGATA TCCGTCAAAA AAACGATGTT
CAGGTCCCCG TAAAATGGAG CAGGCCCTTG TTTATAAATG GCAATGGGGG GCCGGAAGAA
GATGTCTTAA GCCGTACGGT AATCAGGAAT AACATATTGA ACCAGATGGG GGTTTTCATG
GCAGGAAAAG TTAAAGACGC CAATGCATCC TATGCCGTAA TCAGTAATAC CACAGTGGTG
GATATGTTCA GGATGCTTTA TGGCAAGGGG GTCGACAGGA TTGGGAACCA GCTGCGTGTA
CCCTATAGCC AGGTTATATT AAAGGCAGCA GATACAACCA AACTGGTAAA CAAAGTAAAT
GATGCCGTGA GGCCTGAAAA CTTTTATACC ATACAGTTTA CTGCCGAAAA GGTTTTCTCA
ACGGAGAAAT TAAAAGACAT TCTGAAAAGC GACCTGCAAC GGTATTTTGA ATTAAAAGTT
AGCCGGGAAA AAGTAAAGAA AACCTGCCTG GTTGTTTCCA GATCGGAATT TCCTGTTACC
GCGTATAAAG AGGGTACACA GGTTTTGAAC ACCAATGATG GCATGTTGAA ACTAAATGCA
GTTACCTTGC AGGAGCTGTT AAATGCACTC ATTGGGAGAA TTGGCTATTA CAGCGGGCTT
TCTTATCCCA TTGTAAATGA ATCGGGATTT ACAGATAAAC TGGGAAATAT TGAAATTGAC
ACAAACATAC AGGACTGGAA AAAATTAAGT TATGCTTTAA ATAAACATGG TTTTATCTTT
TCATTACAGG AAAGGGAAAT TGAAGCCCTG GTGATCAGTG ACGATCAGTG A
 
Protein sequence
MRFKLLILLA VGILAGGNAF TQTTVVTIGT KIPGHTLTRL INFPRPSVGL TDFKGKILIL 
DFWNRGCTSC IDSWPKLMKL QEKFKDNIQI LLVNDLDDEA TIRALIKRWS KTFEMQMTLP
TAYQDKVVNS MFPHQSVPHL VWIDQVGTVK YISMAQFLNH ETIENMILGK EMDIRQKNDV
QVPVKWSRPL FINGNGGPEE DVLSRTVIRN NILNQMGVFM AGKVKDANAS YAVISNTTVV
DMFRMLYGKG VDRIGNQLRV PYSQVILKAA DTTKLVNKVN DAVRPENFYT IQFTAEKVFS
TEKLKDILKS DLQRYFELKV SREKVKKTCL VVSRSEFPVT AYKEGTQVLN TNDGMLKLNA
VTLQELLNAL IGRIGYYSGL SYPIVNESGF TDKLGNIEID TNIQDWKKLS YALNKHGFIF
SLQEREIEAL VISDDQ