Gene Phep_2694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2694 
Symbol 
ID8253802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3159218 
End bp3160369 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content44% 
IMG OID644936342 
Productglycosyl hydrolase family 88 
Protein accessionYP_003092957 
Protein GI255532585 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000585774 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00821463 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTTTTG CCGTTCCGGC TTTTGCACAA AAGGTTGATG TACAAAAAGC ATTTAAACAG 
GCGGATGCAC AGACGATGCT TATGCTGAAG GAAATTGAAA AGGTTAAAGC TACAACTAAA
AACGATCAGG TTTCGCCGAG AACCCTGGAG AAGGGTAATT TAAAACTGGT AGCCTCGCGC
GACTGGACCA GTGGCTTTTT TCCGGGTGAA CTTTGGTTTT TGTATGAATA TACAGGTAAG
CAAACCTGGA AGGACCAGGC CAGGGCTTTT ACGGCAGCTA TAGAACAGGA ACAGTTTAAT
GGCAAAACGC ATGATATGGG CTTTAAGGTA TATTGCAGTG TAGGTACGGG GTACCGTTTA
ACAAAAGATG AACACTACAA AACGGTGATC ATACAGTCGG CCAAAACCCT TGCAACCCGT
TTTAACCCTA AAACAGGCGT GATCCGCTCA TGGGACCACA GTACAGATAA ATGGGTAAAC
CCGGTGATCA TTGACAATAT GATGAACCTG GAACTGCTTT TTGAAGCCAC GAGGCTGACT
GGTGATTCTT CATTCTATAA AATTGCAGTG AGCCATGCCA ATACAACGAT GAAGAACCAT
TTTCGTGCCG ACTACAGTTC TTATCATGTA ATCGATTATG ATCCGAATAC AGGTGCGGTA
CTGAAAAAGA ATACCCATCA GGGCTATAGC CATGAATCGG CCTGGGCCAG GGGCCAGGCA
TGGGCCTTAT ATGGTTATAC CATGTGCTAT CGTTATACAA AAGACCCTGC CTATCTGAAA
CAGGCAGAGG GTGTTGCCGC TTTTATCCTT AACCACCCCA ATATGCCTGC AGATCTGGTG
CCTTACTGGG ATTTTAATGC ACCTGAAATT CCTAAAGAGC CACGTGACGC TTCTGCTGCT
GCGGTAATTG CTTCTGGTTT ATATGAACTG AGTGGTTACA GTACGCGTAA AAATCTTTAC
CTGCAAAAGG CGGATAAGAT GCTGAACAGC CTGAGTACAA AATATATTTC ACCTGCTGGC
GAAAACAAAG GTTTTATCCT TGTACAAAGT ACAGGATCCA AACCATCTGA CAGCGAGGTA
AATGTGCCGC TTTCTTATGC AGATTATTAT TACCTGGAAG CTTTGTTGCG GTACAAAGCA
ATAAAAAGAT AG
 
Protein sequence
MLFAVPAFAQ KVDVQKAFKQ ADAQTMLMLK EIEKVKATTK NDQVSPRTLE KGNLKLVASR 
DWTSGFFPGE LWFLYEYTGK QTWKDQARAF TAAIEQEQFN GKTHDMGFKV YCSVGTGYRL
TKDEHYKTVI IQSAKTLATR FNPKTGVIRS WDHSTDKWVN PVIIDNMMNL ELLFEATRLT
GDSSFYKIAV SHANTTMKNH FRADYSSYHV IDYDPNTGAV LKKNTHQGYS HESAWARGQA
WALYGYTMCY RYTKDPAYLK QAEGVAAFIL NHPNMPADLV PYWDFNAPEI PKEPRDASAA
AVIASGLYEL SGYSTRKNLY LQKADKMLNS LSTKYISPAG ENKGFILVQS TGSKPSDSEV
NVPLSYADYY YLEALLRYKA IKR