Gene Phep_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0636 
Symbol 
ID8251724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp744039 
End bp745154 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content42% 
IMG OID644934285 
Productpeptidase M42 family protein 
Protein accessionYP_003090920 
Protein GI255530548 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA AGAAAGACGA CAAACAGAAG CATGTTGCTG TAGTTACTAA AAAATCGCTC 
CAGTTTTTTG AAGAATATAT CAACAATCCT TCTCCTACAG GCTTTGAATA TCCAGGCCAG
AAATTATGGC TGGATTATTT AAAACCTTAT ATAGATGAAA GCTTTATAGA CAACTATGGC
ACTGCTGTGG GCATAATTAA TCCTAAAGCT GAATATAGGG TAGTTATCGA AGCACATGCA
GACGAAATCT CCTGGTTTGT AAATTACATC ACTGCTGATG GCCTTATTTA CGTCATCCGT
AATGGTGGCT CTGATCATCA GATTGCACCC TCTAAACGGG TTAATATCCA TACAGATAAA
GGAATGGTTA AAGCGGTGTT TGGCTGGCCT GCAATACATA CCCGTAATGG CGGCGACAAA
GAAGAGGCGC CTGCACTGAA AAATATATTC CTAGATTGTG GCTGCACCAG CAAGGAAGAA
GTAGAAAAAC TGGGTATCCA TGTAGGATGT GTTATCACTT ACGAAGACGA GTTCATGACA
TTGAACGACC GTTATTATGT GGGCAGGGCT TTAGATAACC GTGCAGGTGG GTTTATGATT
GCCGAAGTTG CCCGTTTACT TAAAGAGAAT AAGCAAAAAT TACCCTTTGG TCTCTATATC
GTAAACGCGG TACAGGAAGA GATCGGATTA CGGGGAGCGG AAATGATTGC TGATAGCATT
AAGCCACATG TTGCAATTAT TACCGATGTA ACACATGATA CCACAACCCC AATGATCAAT
AAAATTACAC AGGGCGACCT GGCCTGTGGT AAAGGACCTG TTGTTTCTTA TGCACCTGCT
GTTCAAACCA ATCTGAACAA ATTACTGATC GAGACCGCTG AGAAGAACAA TATTCCTTTT
CAACGCCAGG CTTCATCGCG CTCAACTGGT ACCGATACAG ATGCTTTTGC TTACTCTAAC
GGAGGTGTTC CTTCGGCTTT GATCTCACTA CCGCTCAGGT ATATGCATAC AACTGTTGAA
ATGATCCATA AAGAAGATGT AGACAATGTG ATCAGCCTGA TCTATCACTC GCTGTTGAAC
ATCAAAAAAG ACCATGATTT CAGGTACTCA AAGTAA
 
Protein sequence
MAKKKDDKQK HVAVVTKKSL QFFEEYINNP SPTGFEYPGQ KLWLDYLKPY IDESFIDNYG 
TAVGIINPKA EYRVVIEAHA DEISWFVNYI TADGLIYVIR NGGSDHQIAP SKRVNIHTDK
GMVKAVFGWP AIHTRNGGDK EEAPALKNIF LDCGCTSKEE VEKLGIHVGC VITYEDEFMT
LNDRYYVGRA LDNRAGGFMI AEVARLLKEN KQKLPFGLYI VNAVQEEIGL RGAEMIADSI
KPHVAIITDV THDTTTPMIN KITQGDLACG KGPVVSYAPA VQTNLNKLLI ETAEKNNIPF
QRQASSRSTG TDTDAFAYSN GGVPSALISL PLRYMHTTVE MIHKEDVDNV ISLIYHSLLN
IKKDHDFRYS K