Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3779 |
Symbol | |
ID | 8254913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4532328 |
End bp | 4533908 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644937443 |
Product | carboxyl-terminal protease |
Protein accession | YP_003094032 |
Protein GI | 255533660 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA ATACCCGTTA TAATGTCTTA ATAGCCCTCA CTTATTCCGT TACATTGATT GGCGGAATGT TTTTTGGCTA TAAATTTTTA AAGGACCAGG GGTTTCAATT TCAAAAGCCG GTTCAGTTTG CTGATAGTAA CGCAGAAAAG GTAGATGAAA TTATCCACAT CATCAATAAA AATTATGTGG ATGAAATAAA TGCCGATTCA CTGACCCATT TGCCGATTGA TAGTTTACTG CATCAGCTTG ACCCGCACAG TATATACCTG CCCCCGGCTA AAGCAAACGA GATGGCGGAA ACATTGGGTG GTAATTTTGA AGGTATTGGT GTCGAATATT ATATATTGAA AGATACTTTG CTGATCACCA ATGTAGTAAA AGACGGGCCA GCATTTAACG CCGGCATCAG GCAGGGAGAT AAAATATTGA AGATCGATAC TGCTACAGTG AGTGGGAAAG CCCTGCCAAG GGATCAGATG ATCGGACGGA TAAGGGGCCG TAAAGGGACC GCGGTGAGAT TGACCATTGT GCATCCGGGT GATAACCAGC CAGTAGTGTT TACCGTAAAC CGGAACAGGG TAAAAGTAAG CAGTATTGAC GCTGCTTATA TGCTGAACCC CGAAACCGCT TACATCAGGA TCAGTAAGTT TGGTGCAGAT ACAGACAAGG ACTTTATTGA ATCGGTAAGA ACACTCAAGG TAAAAGGAAT GAAAAAACTG ATCCTTGACC TGAGAGATAA CGGGGGAGGA TATCTGAGCG CAGCAACAGG CCTTGCCAAC CAGATTTTGC CCGAAAATAA GCTAATTGTG TATACAGAGG GTAAACATGA ACCGCGGACA GATTATGTAG CTACCGGTGG AGGGGAGTTT GAACAGGGCA AACTTGCCGT GCTGATTAAT GAAAACTCTG CTTCGGCCAG TGAAATTCTT GCCGGTGCAG TACAGGACTG GGGTAGGGGA GTTATTATAG GGCGCCGTTC TTTTGGTAAA GGCCTGGTAC AGGAACAATT CCCTTTTGGG GATGGTTCTG CTTTAAACCT GACGATAGCC AGGTATTATA CCCCTTCGGG GAAAAGTATA CAAAAGTCTT ATAAAAAGGG CTACAACGCT TATCAGAATG AGATTGAAGA TCGGTTTAAT GATGGTGAGC TTACTTCAGA GACACTAACC GGAGCAAAGG ATAGTTTGCA ACGTAAAAAC TATACGCGCG GGGGTATACA GCCTGATGTT TACGTTAAAC TGGATACAAA TGGCTATAAC CGGTTTTACA GTAAACTGGT GGCTAAAAAG ATACTTTTCG ACTTTGTATA CGATGTATTG GGCAGCAGGT ACAATGCCGC ACAATTAGAA CAAAAAATGA ATGTATTTGC GATCACTGAG ACAGATTATA ATGATTTTTT GAAATATATC CAAAACCGCC ACATCCCGAT AGACTCAAAA CAATTGTATA TTGCTAAGCC GCTGATCTAT AACGACCTTA AATTGTTACT CTATAAATAT CACCTTGGTG ATGCCGGTTA TTATAAGGCG CTGAACCTAC ATGATCCGAT GGTAAAGCAA GCAGTTACGA GTTTGCAATA A
|
Protein sequence | MKKNTRYNVL IALTYSVTLI GGMFFGYKFL KDQGFQFQKP VQFADSNAEK VDEIIHIINK NYVDEINADS LTHLPIDSLL HQLDPHSIYL PPAKANEMAE TLGGNFEGIG VEYYILKDTL LITNVVKDGP AFNAGIRQGD KILKIDTATV SGKALPRDQM IGRIRGRKGT AVRLTIVHPG DNQPVVFTVN RNRVKVSSID AAYMLNPETA YIRISKFGAD TDKDFIESVR TLKVKGMKKL ILDLRDNGGG YLSAATGLAN QILPENKLIV YTEGKHEPRT DYVATGGGEF EQGKLAVLIN ENSASASEIL AGAVQDWGRG VIIGRRSFGK GLVQEQFPFG DGSALNLTIA RYYTPSGKSI QKSYKKGYNA YQNEIEDRFN DGELTSETLT GAKDSLQRKN YTRGGIQPDV YVKLDTNGYN RFYSKLVAKK ILFDFVYDVL GSRYNAAQLE QKMNVFAITE TDYNDFLKYI QNRHIPIDSK QLYIAKPLIY NDLKLLLYKY HLGDAGYYKA LNLHDPMVKQ AVTSLQ
|
| |