Gene Phep_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3559 
Symbol 
ID8254680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4232986 
End bp4233987 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content41% 
IMG OID644937210 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003093812 
Protein GI255533440 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.700407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.724049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGTAA TACTTGCTAT AGAATCTTCT TGCGATGAAA CTTCAGTTGC TATATGTAAC 
AACGGCAAAA TTACTGCCAA TGTTATTGCA AACCAAACAA TTCATGAAAA TTATGGTGGC
GTAATACCTG AACTTGCATC AAGGGTACAT CAACAAAATA TCGTTCCGGT TATACAACAG
GCATTAACTG ATGCTAAAGT AAGCAAAAAG GAATTAAGTG CCGTTGCATT TACCAGGGGA
CCAGGTCTTT TGGGGTCATT GCTGGTTGGT GTTTCATTTG CCAAATCATT TGCTTTGGCG
CTTGATTTGC CCTTAATAGC CGTTAACCAC ATGCATGCAC ACATTCTGGC ACATTTTATT
GATGATCCCA AACCTGCATT TCCTTTTTTA TGCCTTACGG TTTCGGGAGG GCATACCCAG
ATTGTATTGA TTAGGAGTTA TTTTGACATG GAGATCGTGG GGGAAACTCT TGATGATGCT
GCTGGCGAGG CTTTTGACAA GACTGCCAAA ATCCTGAATC TTCCTTATCC GGGCGGACCA
CTGATAGATA AACATGCAAA AGAAGGAAAT CCGCTGGCCT TTAAGTTCCC TGAACCTCAG
ATAAAAGATT TAAATTACAG TTTTAGTGGC TTAAAGACTG CTATCTTGTA TTTTATCAGG
GCGCAGGAAA AAGAAAATCC TGATTTTATT GCCGGCAATT TAAATGATAT CTGCGCATCT
GTACAACATA GTATTGTTGA CATTTTGCTC AATAAATTAA AAAAGGCGGC CCAGCAATAT
GGAATAAAAG AAATTGCAAT AGCCGGTGGG GTTTCGGCAA ACAGTGGCCT GCGGCATGCA
CTTCAAAAAA TGGCGGGACA GCAGGGTTGG AATGTTTATA TCCCCGCATT TCAGTATTGC
ACAGATAATG CTGCTATGAT TGCCATTGCA GGATATCATA AATATTTAAA CGGTGATTTT
GTTGGCCAGG ATGTGGCTCC ACTTTCACGA ATGGAATTTT AA
 
Protein sequence
MSVILAIESS CDETSVAICN NGKITANVIA NQTIHENYGG VIPELASRVH QQNIVPVIQQ 
ALTDAKVSKK ELSAVAFTRG PGLLGSLLVG VSFAKSFALA LDLPLIAVNH MHAHILAHFI
DDPKPAFPFL CLTVSGGHTQ IVLIRSYFDM EIVGETLDDA AGEAFDKTAK ILNLPYPGGP
LIDKHAKEGN PLAFKFPEPQ IKDLNYSFSG LKTAILYFIR AQEKENPDFI AGNLNDICAS
VQHSIVDILL NKLKKAAQQY GIKEIAIAGG VSANSGLRHA LQKMAGQQGW NVYIPAFQYC
TDNAAMIAIA GYHKYLNGDF VGQDVAPLSR MEF