Gene Phep_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4097 
Symbol 
ID8255231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4939135 
End bp4940457 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content44% 
IMG OID644937761 
Productglycosyl hydrolase family 88 
Protein accessionYP_003094350 
Protein GI255533978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGG CGGGGGATCT TTTTAAACAC AGAACAAACG GAAAAGGAGC AGGAAAGGTT 
GCGTGCTCCA TTACAATTGG AATAGAAATT AACCAGATAG TAATGAATAA ATTAAACACC
TACTTATGCG TATTGATTGG CGTCTCGATT GTAGCAGGAT GTGCAGTAAA GGAGCCGGTT
TTTAAAACAG ACCCCCTATT GTTAAAAAAC ATCGACAGCA CCTTTGCCGA TGCAGGTAAA
CAATACCACC TGATGATGAA AAACCTTCCC CCAAACCAAT TCCCGAAAAC TTTTTATCCC
CTTACAGGCA AATTCGAAGC CTGCAATTCC GACTGGTGGG TAAGCGGGTT TTATCCAGGT
TCTTTGTTGT ATATCTATGA ACAAACGAAA GATACAGCAC TTTATAACGA AACCCGCAGG
ATCCTGAAAG TGCTGGAAAA AGAGAAAAAC AATACCAGCA CTCACGACCT TGGCTTTATG
ATGTACTGCA GTTTCGGAAT GGCCGATAGG ATCAAACCAG AACCTGAGTA TAAAGACATT
TTAATCACCA GCGCCAAATC GCTGGCCAGC AGGTTTAACC CTAAAGTGGG TTGCATCCAG
TCATGGGATG CAAAACCTGA CGAGTTTCTG GTGATCATAG ATAACATGAT GAACCTGGAA
CTCCTGTTCT GGGCAACAAA GGAAACCGGC GACTCCAGTT ATTATAAAAT AGCCGTTACA
CATGCCAGTA CCACCATGAA AAATCATTTC CGGCCCGATT ACAGTTCCTA TCATGTCATC
GATTACAACC CGGAAACAGG CCTGGTACAA AAGAAAAGAA CAGATCAGGG CTATGCCGAT
GAATCGGCCT GGGCAAGAGG GCAGGCATGG GGCTTATATG GCTATACGCT GATGTACCGT
GAAACCAAAG ACAAAAAGTA CCTGGAGCAG GCCAACCATA TCGCTGAATT TATATTAAAG
CATCCCAACC TTCCCCAGGA TAAAATACCT TACTGGGATT TTAATGCTCC CGACATTCCC
AACGCGCTAA GAGATGCCTC GGCAGGGGCC GTTATGGCAT CAGCACTGCT GGAACTTTGC
CGGTATAATA AAGGAGAGCA GGGACAGCAT TATTTTAAAA CTGCCGAAAA AATGATAAAG
ACACTTTCTT CTGCACAATA TAAGGCTGCT ACAGGTACAA ACGGCGGGTT TATCCTTAAG
CATGGTGTAG GTCATATGCC CAAAGGCTCT GAAGTGGATG TGCCCCTTAC TTATGGCGAC
TATTATTACC TCGAAGCACT AAAGCGGTAC AAAGAAATGA AAACCCAAAC CAACCTATTT
TAA
 
Protein sequence
MIQAGDLFKH RTNGKGAGKV ACSITIGIEI NQIVMNKLNT YLCVLIGVSI VAGCAVKEPV 
FKTDPLLLKN IDSTFADAGK QYHLMMKNLP PNQFPKTFYP LTGKFEACNS DWWVSGFYPG
SLLYIYEQTK DTALYNETRR ILKVLEKEKN NTSTHDLGFM MYCSFGMADR IKPEPEYKDI
LITSAKSLAS RFNPKVGCIQ SWDAKPDEFL VIIDNMMNLE LLFWATKETG DSSYYKIAVT
HASTTMKNHF RPDYSSYHVI DYNPETGLVQ KKRTDQGYAD ESAWARGQAW GLYGYTLMYR
ETKDKKYLEQ ANHIAEFILK HPNLPQDKIP YWDFNAPDIP NALRDASAGA VMASALLELC
RYNKGEQGQH YFKTAEKMIK TLSSAQYKAA TGTNGGFILK HGVGHMPKGS EVDVPLTYGD
YYYLEALKRY KEMKTQTNLF