Gene Phep_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3194 
Symbol 
ID8254313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3800819 
End bp3801952 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content46% 
IMG OID644936847 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_003093451 
Protein GI255533079 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.274558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.381413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG TATTATTTAT AGACAGGGAC GGCACCCTGA TCACCGAACC GGAAGATGAA 
CAGATCGACT CTTTTGCCAA ACTGGAGTTT TATCCCAGAT CGCTCTATTA TTTATCGAAG
ATAGCAAAAG AGCTGGATTA CGACCTGGTG ATGGTAACCA ACCAGGATGG ATTGGGCACA
GCTTCGCATC CTGAAGAAAA CTTCTGGCCG GTGCACAACT TCATTATGAA AACCTTTGAA
AATGAAGGTG TCATTTTTAA CGAGGTGATC ATCGACAAAA CCTTTGCCCA TGAAAATGCG
CCTACACGCA AGCCGAATAC AGGTTTACTG CAACATTTTT TAGCAGAGGG CTATGATCTG
GCGAATTCAT TTGTGATAGG CGACCGCATC AATGATGTGA TTCTGGCTAA AAACCTGGGT
GCAAAAGCCA TCTGGCTGCG CAACAACGAC CTGCTTGGTG CCAATGAAGT ATTGGAAAAA
GCGGACACGC TGGAGAACAT CATCGCTCTG CAAACACAGG ACTGGGTAGA TATTTACACC
TTTTTAAGGG CAGGCAGCCG GACCGTACAT CATGAGCGCA ATACCAATGA AACGAAGATC
AGCATTGACA TTGACCTGGA TGGTACCGGA AAGGCAGTGA TCCGTACCGG ACTGAATTTT
TTTGACCACA TGCTGGACCA GATTGCCCGT CACGGCAGCA TTGACCTACG CATTAAAACC
GATGGCGACC TGCATATTGA CGAGCACCAT ACCATTGAGG ATACGGGCAT TGCCTTAGGG
GAGGCTTTCG CAAAAGCCAT TGGCAACAAA CTTGGGCTGG AACGCTATGG CTTTTGTTTG
CCCATGGATG ATTGCCTGGC ACAGGCAGCC ATTGATTTTG GTGGCCGTGC CTGGATAGTA
TGGGATGCAG AATTTAAGCG GGAAAAGGTA GGCGATATGC CTACAGAAAT GTTTTATCAT
TTCTTTAAAT CCTTTAGCGA TGCGGCCAGA TGCAACCTGA ACATCAAAGC AGAAGGAGAT
AATGAACATC ATAAAATTGA AGCTATATTT AAAGCCTTTG CCAAGGCGAT TAAAGTCGCA
ATTAAACGCG ACCCGGATAA ATTAGTCTTA CCAAGTACCA AAGGGTTATT GTAA
 
Protein sequence
MKKVLFIDRD GTLITEPEDE QIDSFAKLEF YPRSLYYLSK IAKELDYDLV MVTNQDGLGT 
ASHPEENFWP VHNFIMKTFE NEGVIFNEVI IDKTFAHENA PTRKPNTGLL QHFLAEGYDL
ANSFVIGDRI NDVILAKNLG AKAIWLRNND LLGANEVLEK ADTLENIIAL QTQDWVDIYT
FLRAGSRTVH HERNTNETKI SIDIDLDGTG KAVIRTGLNF FDHMLDQIAR HGSIDLRIKT
DGDLHIDEHH TIEDTGIALG EAFAKAIGNK LGLERYGFCL PMDDCLAQAA IDFGGRAWIV
WDAEFKREKV GDMPTEMFYH FFKSFSDAAR CNLNIKAEGD NEHHKIEAIF KAFAKAIKVA
IKRDPDKLVL PSTKGLL