Gene Phep_4225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4225 
Symbol 
ID8255361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5102153 
End bp5103337 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content46% 
IMG OID644937891 
ProductPUA domain containing protein 
Protein accessionYP_003094478 
Protein GI255534106 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.731484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATG TAATCCTAAA AAAAGGCAAA GAAAAAGCCG CTATGCTCCG TCACCCCTGG 
ATCTTTTCTG GTGCAATTGA TAAAATAAAG GGAAATCCTG AAAATGGAGA AGTGGTAACC
CTAAGGTCAG CAGCAAAAGA ATTTCTGGCC TATGCCTATT ACAACGATCA ATCCCGGGTA
GCCCTTCGTT TGCTCGAATG GGATGAAAAT ACCACGATAA ACAAGTCCTG GTACCGGGAA
AAACTCAAAG CCGCCATCGC CTCAAGGGAC CATGTATTAA ACGAAAATAC CAATACCTGC
AGACTGGTAT TCAGTGAAGC AGATTTTTTA CCTGGCCTGA TCGTAGATAA ATATGCCGGC
TTTCTTTCCC TGCAAATATT AAGTGCAGGC ATGGAAAGCA TAAAAGCAAC ACTTATCGAG
CTCCTCAGGG AATTGCTAAA CCCAACAGGG ATATTCGATA AAAGTGATGC CGGCGCACGT
AAACATGAAA ACCTCGAAGC TACACAAGGT CTTTTATGGG GAGAAACCCC TCCTGAATTT
ATTGAAGTAA AAGAAAACGG CATTCTTTAC CACATCAACA TTGCCGATGG CCAGAAGTCA
GGCTTCTATT GCGACCAGCG CGACAACCGG AAAATCCTGG CTGATTACAC CAGGGGCAAA
TCCGTTCTGG ATTGTTTCTG CTACAGCGGA GGATTTACCT TAAACAGCCT GAAATCCGGT
GCCGCATCCG TTACCAGCGT AGACAGTTCT GCACTGGCCA TTGAAACCCT GCAGCACAAT
CTGGAACTGA ACGGTTTTAA AGGATCCAAT CAGCATAGCA TACAATCTGA TGTCAACAAG
CAACTGCGCA TTTTTAAAGA CGAAGGAAAA ACCTTTGATG TATTGGTGTT AGACCCGCCA
AAATACGCGC CTTCAAGATC CGCACTGGAC CGGGCAGCAA GGGCTTACAA AGACCTGAAC
AGGCTTGGCA TGCTCCTGCT TAAAAAAGGG GGCATCCTGG CTACATTTTC CTGCTCCGGA
GCAGTAGACA TGGAAACCTT TAAACAGATC ATTGCATGGG CCGCATTGGA TGCAGGCCGC
GAAGTACAGA TCATCAGACA GTTCTGTCAG CCTGAAGACC ACCCGGTAAG TCTTTCTTTC
CCTGAAGGAG AATATCTAAA AGGATTATTG CTAAGGATAC TTTAA
 
Protein sequence
MIDVILKKGK EKAAMLRHPW IFSGAIDKIK GNPENGEVVT LRSAAKEFLA YAYYNDQSRV 
ALRLLEWDEN TTINKSWYRE KLKAAIASRD HVLNENTNTC RLVFSEADFL PGLIVDKYAG
FLSLQILSAG MESIKATLIE LLRELLNPTG IFDKSDAGAR KHENLEATQG LLWGETPPEF
IEVKENGILY HINIADGQKS GFYCDQRDNR KILADYTRGK SVLDCFCYSG GFTLNSLKSG
AASVTSVDSS ALAIETLQHN LELNGFKGSN QHSIQSDVNK QLRIFKDEGK TFDVLVLDPP
KYAPSRSALD RAARAYKDLN RLGMLLLKKG GILATFSCSG AVDMETFKQI IAWAALDAGR
EVQIIRQFCQ PEDHPVSLSF PEGEYLKGLL LRIL