Gene Phep_3734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3734 
Symbol 
ID8254866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4455123 
End bp4456913 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content43% 
IMG OID644937396 
Productpeptidase M61 domain protein 
Protein accessionYP_003093987 
Protein GI255533615 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAA CATCAATATT AGGATTGGTA ATATTACTAT TTGCAGGTAT GACAGCAAAA 
TCGCAGGTTA AAATAAGATT CGATATCAGT TTTACAGAGC CCCAGGCGCA CTATACGGAG
GTAGAAATGA ACATTTCAGG CCTGGTTAAA GATTATATAG ATATAAAAAT GCCGGTATGG
GCCCCGGGTT CGTACCTGGT GCGCGAATTT GCTAAAAGTG TGGAAGGTTT TGGGGCTACG
GCCAATGGCA AAATCCTTAA ACATGAAAAG GTGAGAAAAA ACACCTGGAG GGTGTATACC
GCTAAAGCGA ATGCCCTTAA AATTAAGTAC AGGGTATATG CTTTTGAAGT TTCGGTGCGT
ACATCTTTTA TTGATGAAAG CCATGCGTTT TTGTCGAGCA GCGGTATATT TATGTACCCT
GAGGGTTTGC TTAAAACACC GAGTACAGTA AAGATCAACC CTTATAAAGG CTGGACTAAA
GTATCTACCG GACTGGAACC GGTTTCGGGG CAGCAATTTA CCTATACCGC TGCCGATTTT
GACATTTTGT TCGACAGCCC TATTGAAGTA GGTAACCAGG ATGTTTTTGA ATTTATGGCC
TCGGGGCTTA AACATGAAGT GGCCATGTAC GGCGGCGGGA ATTATGATAA GGAGCGCCTT
AAAGTGGATA TGGCCAAAAT TGTGGAGCAG GGAACGGCCA TTTATGGCGA AAACCCGAAT
AAGCATTATA CTTTTATTGT CCATAATTTT TCTTCCGGTG GCGGCGGTCT GGAACACCTG
AACTCTACGG TACTGGGTGC AAAACGCGAT GCTTATGTTA CTGAAACCGG CTATAAAGGC
TTTTTAGAGC TGGTTGCGCA CGAGTATTAT CATTTATGGA ACGTAAAAAG GATGCGCCCT
GTGGCCTTAG GCCCCTTTGA TTACGACAAT GAGAACTATA CCACCAATTT ATGGATAGCA
GAGGGTTTTA CCGCCTATTA CGAAAATAAA CTGATGCTGC GTGCTGGTTT TACAGACGAG
AAAGGCTTTG TAGATGCACT GGTTACGGCA GTAAGCAATG TTTCCAATAC CCCAGGCAAC
AAGGTGCAGT CGGTTGCCGA AGCCAGTTAC GATGCCTGGA TCAAATATTA CAGGCCTAAT
GAAAATTCGA ACAACACTAC AGTTTCTTAT TATGCCAAGG GCGAGGTAGT AGGCTTATTG
ATGGACCTGG AAATAGCCCA TGCCACAAAA GGTACTAAAA GTCTGGACGA TGTGATGAAA
GCCATGTACC TGCAAAATAA AGCACAAAAA AGAGGGTATA CCGATGCAGA ATTTAAGGCT
ATGGTAGAAA AGATCAGCGG AGCCAGCTTT ACAGATTTCT GGGCTAAATA CGTGAATGGC
ACCACTGCAA TCGACTACCA TAAATATTTT GGCTATGCAG GTATCAACAT AACCAATGAA
AATGAAGGTA AAAGTATCCC GTATTTGGGC ATTGCCACTA AAAATCAGGG AGGAAGGGTC
TTTATTACCA CAGTTTCACG CAATTCGGCG GCCTGGGTTG ACGGGCTCAA TGTGAATGAT
GAAGTGATCA GTGCAGATGG TGCCCCTGTT GAAATAGCGA TTGATAAAAT GGCCGCAGTT
GCCGGTAAAA AAGTTGGAGA GACGGTAACT TTTAAGGTAG CCAGAGATGG GATCCTTAAA
GACATTACAG TTACCCTTAA AGCCAGTCCG AACCTGAAAC TTGTGGGGCA GATAGACGAA
AAGGCCACAG AACTGCAAAA GGCAGTAAGG AAGGCCGTAT TGTTTAAATA A
 
Protein sequence
MIKTSILGLV ILLFAGMTAK SQVKIRFDIS FTEPQAHYTE VEMNISGLVK DYIDIKMPVW 
APGSYLVREF AKSVEGFGAT ANGKILKHEK VRKNTWRVYT AKANALKIKY RVYAFEVSVR
TSFIDESHAF LSSSGIFMYP EGLLKTPSTV KINPYKGWTK VSTGLEPVSG QQFTYTAADF
DILFDSPIEV GNQDVFEFMA SGLKHEVAMY GGGNYDKERL KVDMAKIVEQ GTAIYGENPN
KHYTFIVHNF SSGGGGLEHL NSTVLGAKRD AYVTETGYKG FLELVAHEYY HLWNVKRMRP
VALGPFDYDN ENYTTNLWIA EGFTAYYENK LMLRAGFTDE KGFVDALVTA VSNVSNTPGN
KVQSVAEASY DAWIKYYRPN ENSNNTTVSY YAKGEVVGLL MDLEIAHATK GTKSLDDVMK
AMYLQNKAQK RGYTDAEFKA MVEKISGASF TDFWAKYVNG TTAIDYHKYF GYAGINITNE
NEGKSIPYLG IATKNQGGRV FITTVSRNSA AWVDGLNVND EVISADGAPV EIAIDKMAAV
AGKKVGETVT FKVARDGILK DITVTLKASP NLKLVGQIDE KATELQKAVR KAVLFK