Gene Phep_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3864 
Symbol 
ID8254998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4636577 
End bp4637779 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content45% 
IMG OID644937528 
Productglycosyl hydrolase family 88 
Protein accessionYP_003094117 
Protein GI255533745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.357687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTATA AATTAAAAAC AAGCTGTTTT GTATTGCTGT TCATGGCTTT TGCCGCGGTG 
TCGGTAAATG CCCAGAAACG CTTTAAGCCG AATGGAAATC TATTAAAAGC GGTAAAAAGG
GGTCTGAACG AATCAGCTCT GCAATATCAT TTTTTGATGG AACAGCTACC TGCCGGACGT
TTCCCCGTTA CCTATTACAG TAAAGAACAA AAAATGATTA CCAGCGGCTC CGAGCCCTGG
GTAGGTGGTT TTTATCCGGG GGGGTTACTT TACCTGTATG AAAGCACTAG AGATACAGCG
CTATATAATG AAGCTTTACG CAAACTAAAA TTGCTGGAAA AAGAGCAGTT TAACAAAACT
ACGCACGACC TTGGCTTTAT GATGTATTGT TCTTTTGGAA ATGCCCGAAG GCTGATGCAC
ACAAGCGCAT ACGATCAGAT CATCATCAAC AGTGCAAAAT CACTTTCCAG CCGTTATAAT
GATAAAGTGG GCTGTATCCG TTCATGGGAC TCTGATGCTG CACGTTTCAT GGTCATTATA
GACAATATGG TCAATCTGGA ACTGCTGTTT GCTGCAACAA AATTAACCGG AGATTCCAGC
TATTACCACA TCGCAGTGAA ACATGCGAAT ACCACCATGA AGCACCATTA CCGTGCGGAT
TACAGTTCCT ACCATTTGGT CATCTATAAT CCTGAAACCG GTGCTGTTTC CAAAAAACAA
ACAGTTCAGG GGGCAGCTGA TACTTCAGCA TGGGCGAGGG GGCAGGCCTG GGGATTATAT
GGTTATACTG TAATGTACCG GGAAACAAAG GATAAAAAAT ATCTGGATAT GGCCAATCAC
ATTGCGCAGT TTCTCCTTGG CCACCCCAAT CTGCCGAAAG ACAAGATCCC TTACTGGGAT
TTTAATGCAG CAGGTATTCC CAATGCACCC AGAGATGCAT CTGCAGGTGC AGTGATCTGT
TCAGCTCTGA TCGAACTGGC GGGCTACGCC GGCCCTAAAA TGGCGAAAAC TTATTTTAGT
GCGGCAGAGA CCATGCTTGG GGCGTTGTCT TCTCCTGCCT ATCGTGCTGC AACAGGGGAA
AATGGCGGGT TTATTCTAAA ACATGGCGTT GGTAATTACC CCCGTAATGC AGATATAGAT
GTGCCCCTGA TTTACGCAGA TTATTATTAC ATCGAGGCCC TGTCAAGATA TCAGAAACTA
TAA
 
Protein sequence
MHYKLKTSCF VLLFMAFAAV SVNAQKRFKP NGNLLKAVKR GLNESALQYH FLMEQLPAGR 
FPVTYYSKEQ KMITSGSEPW VGGFYPGGLL YLYESTRDTA LYNEALRKLK LLEKEQFNKT
THDLGFMMYC SFGNARRLMH TSAYDQIIIN SAKSLSSRYN DKVGCIRSWD SDAARFMVII
DNMVNLELLF AATKLTGDSS YYHIAVKHAN TTMKHHYRAD YSSYHLVIYN PETGAVSKKQ
TVQGAADTSA WARGQAWGLY GYTVMYRETK DKKYLDMANH IAQFLLGHPN LPKDKIPYWD
FNAAGIPNAP RDASAGAVIC SALIELAGYA GPKMAKTYFS AAETMLGALS SPAYRAATGE
NGGFILKHGV GNYPRNADID VPLIYADYYY IEALSRYQKL