Gene Phep_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1078 
Symbol 
ID8252172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1268649 
End bp1269800 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content41% 
IMG OID644934729 
Productalkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen 
Protein accessionYP_003091358 
Protein GI255530986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.899255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00036256 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAATC GTAACGTAGT CTTCGCATTT TTTTTCCTTG GCAGCCTTTT GTGTTCTTTT 
TTCTCTTACG GACAACAAAA CGGACTGAAT ATAAATCTTA AGCTAAAGGG CTTAAGTTCG
TTAAATTATG CTGTATTGTA CAAATATAAT GGAAACATAG CGGATTCCAT CGCAAAACAA
AAAATGATAA ATGGTACTAC AGTAATCCAT GCATCTGTAC AGATTGAACC GGGATTTTAT
TTTCTTAAAG TAGGTAATTT AAAGGAGACT TTGCCTTTGT TTATAGGAGC AGGAAAAGTC
GATGTAACGG GTGAGGCGTC CTTGTGGCCA AAAGTACAGG TAACTGGCGC AGTGGACCAC
AAGGATTACC AGGATTATCA TTTATTGACA GATTCTTTGG AAAGTGCTGG GCAGGCATTG
TTTAAGGTCT GTACAGCTCT TCAGGATTCA AAAGATACAA CCGCCCAGTC TGCTTTGCAG
AATAAGATAA TGAGTAACAT CACGTCGTAT GAGAAAAAAC AGTTGGACTT TGTAAGCAGC
CACCTCGATT CTTACTACAG CCCAGTAGTG ATTAATAATG CTAAATGGGA TTGGACAAGG
AAGAGAGCTG CTTACGATAG GTTGACCCCA TCAATTAAAG CCAGCAAATA CGGGGTTTTC
CTTGCTGAAA ACATTGCCAA ATGGAAAAAG GCTTCTGGCT TACTGGAGAT AGGTGATACT
GTTCCTGAGT TCATTGCCAA AACTGCCAAC AATGCTGATC TCTCTTTACA AAAAGAAATA
GCTGATAACA AGCTTACTTT AATTGACTTC TGGGCCAGCT GGTGTGTGCC TTGCCGGCAG
GAAAACCCTA ACCTCGTCAA GACCTTTCAG GAAAACAAAT CTAAAGGTTT TGACATCATA
GGGATATCAC TGGATGAAAA GTCAGCAGAA TGGAAAGCGG CCATTTCGAA AGATGGGCTG
GTATGGCGGC AGGTTTCAGA TTTAAAGGGC TGGGCATCGC CTATTGCCAA GCTTTATTTT
GCGGGAATGC CTTTTAATTA TATCCCCCAG AATTTTCTGG TAGATGGGAA AGGAAAGATT
CTTGCCAGGA ACCTACGTGG CGATCAGTTG AGCAAAAAGG TTGATGAACT ATTAAAAGGG
AACAGCTTAT GA
 
Protein sequence
MINRNVVFAF FFLGSLLCSF FSYGQQNGLN INLKLKGLSS LNYAVLYKYN GNIADSIAKQ 
KMINGTTVIH ASVQIEPGFY FLKVGNLKET LPLFIGAGKV DVTGEASLWP KVQVTGAVDH
KDYQDYHLLT DSLESAGQAL FKVCTALQDS KDTTAQSALQ NKIMSNITSY EKKQLDFVSS
HLDSYYSPVV INNAKWDWTR KRAAYDRLTP SIKASKYGVF LAENIAKWKK ASGLLEIGDT
VPEFIAKTAN NADLSLQKEI ADNKLTLIDF WASWCVPCRQ ENPNLVKTFQ ENKSKGFDII
GISLDEKSAE WKAAISKDGL VWRQVSDLKG WASPIAKLYF AGMPFNYIPQ NFLVDGKGKI
LARNLRGDQL SKKVDELLKG NSL