Gene Phep_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0331 
Symbol 
ID8251416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp384334 
End bp387351 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content45% 
IMG OID644933979 
ProductHeparinase II/III family protein 
Protein accessionYP_003090617 
Protein GI255530245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA AATTTAGCCT GATCCATGTA TTCATGGTCC TGATGTTATT GGGACATACT 
AATGCCCATG CCGGCCGGAT CTATGGTGGC TTCTATACCG CTGAAAAAAT AGCCAATGTA
AGAAGCAATT GTAACAGATA TGATTGGGCT GCAAAACAAA GGGACCGGTT TATTGCCCAG
GCAAAATACT GGCTGGCCAA AGATGACGAA ACGCTATGGG CCATGGTTCC CGGACAGGAC
CTGCCGCGAT GCATAGATGT TACCTTTGAC AGGCTGACCG AGGGCCCTAA ATTTTTAGGT
TGTCTGAAAT GCGGACACAA CATTTCAAAA TATGGTAATT ATCCATACAA TCCTGAATTC
GAGCAAAAGC CCTGGAAATT GACCTGCCCT TCCTGCGGCA CTGTTTTTCC TACCAACGAT
TTTGGTAAAT ATTATAAAAG TGCCATTGAT GAACATGGTC TTTTTAATCC GGCAAGTGGC
GACAAGAGCC TGCTTTACAA CCTGGAACAT CCTGATCCAA AAGATCCGCT GCATAAATAT
GGTGTAGACG ATGGATTTGG TTATGTAAAT GAAAATGGGC GGTCGTACAG GTTCATCGGA
TATTATACCT GGAAGTACTG GGACCATATT AATGCAGGAC TCAACGCCCT GGCAAATGCA
TTTTTGTATA CCGGAGATCA GCGCTACGCA CATAAGGCGG CAATTTTGCT GGACCGCATA
GCGGATGTTT ATCCCGACAT GGATTGGGCG CCTTATTCAA AAAAAGGCTG GTACCATTCG
GATGGAGGGA CAAACAGAGG TAAAATAGAG GGAGCCATCT GGGAAACCGG TACAGTTCAG
GGCTTTGCAG ATGCTTATGA TAAAATTTTA AGTGGTACGG TTAATGATGC TTCACTTTAT
TCATTTTTAA AAAAGCAATC GTTGAAATAT AAACTGCCCG GGGCCAAAGG CACACGGGAC
CTGTTTGTAA AAAATGTTGA TGACGGAATC TTGCGTACAG CTTTTAAAGG CGTGCTTTCA
AAACAGATCT GGGGCAACCA GGGTATGCAC CAGTTAACTG TTGCCAGATG TGCAGTAGCT
TTAAATACGG CCCCTGAAAC TACGGAATGG CTCGATTGGT TGTTTACCCA TGATGGAGGA
AACATTCCGG GCCTGATGAT CCGTAACCTG GACAGAGATG GTACTACCGA TGAAGGTGCT
CCGGGGTATA CTTATATGTG GGGAAACCTG ATCGCCAAAC TCGGGGTGCT CCTGGCTGAC
TACAAGAGTT ATACCAAGCA CGATATTTTT GCCGAATATC CTCAGTTCAA CGCTACATTT
ATGGCGGCCT ACCGCATGTC TGTATTAGGG ATTTCGATAC CTAATATCGG CGACGCAGGC
GCTACAGGTA CGGTTACCAA CAGTTATATC GACCCCAATT TTATTGCCCT GGGGTATTTT
TATACCAGAG ATCCGCAAAT GGCAATTGCA GCCTACCGGG CAAATGGTAA TTCTGTAAAG
GGGCTGGGAA TGGACATCTA TTCCAAAGAT CCTGAATCAC TGAGCCGGGA GATCAAAAGC
GTTGCAGAAA AGAACCCCGC AAATGACGGA AGAGGGGGAT TGATGAGCGG ATTTGGCCTG
GCCTCACTGG AAATTGGTAA AGGCACATCG GGCATTGCAC TGGCCAGCAA TTTTGGACGG
AGCATTAAAC ATGCACATCC GGATATGCTC AATTTTGACC TGCTGGCTTA TGGCAACTGG
CTTGCGCCTG ATCATGGATA CCCCGAATAT GCAACAAAAT GGCCAAGTAA CAACGAATGG
ACAGGCAGTA CGCTTTCGCA CAACCTGGTA TTTGTAAACG GGCTTCCGCA AAAGGAAGTA
TGGGGCGGAC ATACCCGGAT GTTTAAACAG CTGAAAGGAT TTGGGGCTTT TGAACTTGAT
GGAAAAAAGG CCTATCCGGA TGTAAAGGAA TACAGTCGGA CAATGTTGTT GATCGAAGGA
CCCGATACCG GCAATGCCTA TGCAATAGAT ATATTCCGGG TATTGGGTGG TCATGATCAT
CTGTACAGTT TTCATGGCCC GCCAGGAACG ATCAGTACCG AAGGATTAAA GTTAAAGCCC
CAGCAGGGAG GAACTTATGC CGGAACTGAA GTAGCCAAAG GTACGCTTGC CAAAGGTTTT
CCAATTGGTT ATTCACATTT ATACAATGTT AAAAGAGACA CCATTCCTCC CCCACAATTT
ATGCTCGACT GGAAAGTGGA GGATGGCTAC AGAAATGTTA AAGCAACAGA CCATCTGCAT
TTGCGTATGT ACGCGTTGAA CCAGGTGGAT GATGTGGCCT TGGCCGATGG TGATCCACCA
CAGAATAAAA CGGGTAATCC TAAAACGCTC GGTTATGTTT TAATGCACCG CGCAGCACCA
GCGCTAAACA GTAATTTCGT TAACCTGATT GAACCTTATA AGCAAAACCC CTTTATCAAA
TCCGTAAAAC GTTTGGATGA GGGGAAAAAC ATGCAGGTAT CGCTAAAGAT AGAACATGTA
AACGGAGAGA TAGATTATAT CCTTTATAAT CCGGATTCAA CACAAACTAT GCAGACTGCA
GATGGATTGA AAATGGATGG GACCCTTGGC TATGTGAGAC AAAAGGGGGG TAAGCCTGTT
GAAGGGATCC TGCTGAATGG CAAACGGCTC AGCTATGCAA ACATGAACCT GATAGCTGCA
GGACCGATCA GGGGAAAAGT GGTGAAAATG AACAGGGAAT TAAAGGGGGG AGGATGGCTG
CTGGTTGATC AGCAGCTACC TGTTGATGGA AGTTTAAACG GATCTCAGCT CATGGTCAGC
ACAGAAGGGA AACGCGATGC CTGCTATTCA ATTGTCGGCA TTGAGCGTCA GGGCAATTTA
ACCAGGGTTT ACTGCGGCCC CATTACTTTT GTTAATGATT ATAAAGGTGA AAATTATAAA
GAAGGATTGA TGTATGACTT TGAAGAAGGC GCTGCATTTA CCATTACTTC TCATAAAATA
TGGAAACAAA AAATTTGA
 
Protein sequence
MKLKFSLIHV FMVLMLLGHT NAHAGRIYGG FYTAEKIANV RSNCNRYDWA AKQRDRFIAQ 
AKYWLAKDDE TLWAMVPGQD LPRCIDVTFD RLTEGPKFLG CLKCGHNISK YGNYPYNPEF
EQKPWKLTCP SCGTVFPTND FGKYYKSAID EHGLFNPASG DKSLLYNLEH PDPKDPLHKY
GVDDGFGYVN ENGRSYRFIG YYTWKYWDHI NAGLNALANA FLYTGDQRYA HKAAILLDRI
ADVYPDMDWA PYSKKGWYHS DGGTNRGKIE GAIWETGTVQ GFADAYDKIL SGTVNDASLY
SFLKKQSLKY KLPGAKGTRD LFVKNVDDGI LRTAFKGVLS KQIWGNQGMH QLTVARCAVA
LNTAPETTEW LDWLFTHDGG NIPGLMIRNL DRDGTTDEGA PGYTYMWGNL IAKLGVLLAD
YKSYTKHDIF AEYPQFNATF MAAYRMSVLG ISIPNIGDAG ATGTVTNSYI DPNFIALGYF
YTRDPQMAIA AYRANGNSVK GLGMDIYSKD PESLSREIKS VAEKNPANDG RGGLMSGFGL
ASLEIGKGTS GIALASNFGR SIKHAHPDML NFDLLAYGNW LAPDHGYPEY ATKWPSNNEW
TGSTLSHNLV FVNGLPQKEV WGGHTRMFKQ LKGFGAFELD GKKAYPDVKE YSRTMLLIEG
PDTGNAYAID IFRVLGGHDH LYSFHGPPGT ISTEGLKLKP QQGGTYAGTE VAKGTLAKGF
PIGYSHLYNV KRDTIPPPQF MLDWKVEDGY RNVKATDHLH LRMYALNQVD DVALADGDPP
QNKTGNPKTL GYVLMHRAAP ALNSNFVNLI EPYKQNPFIK SVKRLDEGKN MQVSLKIEHV
NGEIDYILYN PDSTQTMQTA DGLKMDGTLG YVRQKGGKPV EGILLNGKRL SYANMNLIAA
GPIRGKVVKM NRELKGGGWL LVDQQLPVDG SLNGSQLMVS TEGKRDACYS IVGIERQGNL
TRVYCGPITF VNDYKGENYK EGLMYDFEEG AAFTITSHKI WKQKI