Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0056 |
Symbol | |
ID | 8251140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 57372 |
End bp | 60188 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644933705 |
Product | peptidase M16 domain protein |
Protein accession | YP_003090344 |
Protein GI | 255529972 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.456752 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATC ATTTTTTATT TTTTACTGTA ATGCTATTGT ATGCTGTACC GTTAATGGCA CAGCATACAA CAAAAACCAC CGCTAAGGCC GGATTGTTGC CGGTAGATAC GGCAGTTAAA ATCGGGAAGC TGCCCAATGG GCTTACTTAT TACATTCGTA AAAACAGCGA ACCGGCAAAA CGGGCTGTCC TGTACCTGGT TACACATGTC GGTTCCTTAA TGGAAGATGA TGACCAGCTG GGCCTTGCCC ACTTTACAGA ACATATGGCT TTTAACGGTA CAAGGGATTT CCCTAAAAAT GAACTGGTCA ATTACCTCCA AAAGGCCGGG GTAAAGTTTG GTGCGGATGT AAACGCTTCT ACTTCTTTTA ATGAAACGAT ATACAAGCTC CCTTTGCCAA CAGATAGTAT GGCCGTTTTT AAAAAGAGCT TCAGCATGAT GGCCAACTGG GCAGGACTGG TTACTTTTGA AGAGGGGGCA ATAAACAGGG AGCGCGGGAT CATTTTAGAA GAAGAACGCT CAAGGGGCAA AAATGTGGCA GAACGCATTC AGAAACAGGT AATACCTGCC TTGCTGAACA ACTCCAGGTA CGCCAGTCGC ATGCCAATTG GCAAAGATGA GCTGTTAAAA ACCTTTAAAC CTGATGTGAT CAAAAGGTTT TATAAGGACT GGTACCGGCC CGACCTTCAG GCCGTTATTG CCGTTGGTGA TTTTGACCCG AAGCTGGTAG AAAGGCTGAT CAGGGAAAAT TTCAGCACTT TAAAAAATCC GATGCACAGC AGGAAAAGAA TAAACTATTC CATTCCCCCT GATAAAGGCA CACAAATAAA AATCATTACA GACCCCGAGC AGACCAGTAC CAGGATGCAG ATCATTGTAC GGCACCAGGG TAAAGCGGTC AGAACCAGTG CCGATTTGAT AGAATCATTA AGCCGGAGCC TGCTGAACCG GATGTTGGGT AGCCGTGTTG CTGAACTGAG ACAGCAGGCC AATGCTCCCT TGCTGATCGG CTCTATTGGT TATGGCCGCT TTCTGGCTGA TATTGATGCC TTTACAATTG CGGTAACCGC CAAGCCGGGA CAGTTGGAAC AGGCAGCAAA AAAAATACTT GCGGTTAATG AACAGGCCAG GCAATTCGGG TTTACCCAGG CTGAACTGGA AAGTGCAAAG AAATCGCTTT TCAATACCAT AGAAAGACAA TGGAAAGAAC GCGACAAGAA CAGTTCTGCC GCTTATGTGA CCGAATACCT GAATCACTTT ACAAAGGGAG AAGCCATTCC CGGCATCGAT TATGAGTATC ATTTCCTGAA AAATAACCTC GCTAAGATTA AGCTTGAGCA ACTGGATAAA CTGATCCGCG CTTTGAACAG CACAGAGAAC CGGGTAATTA TTATAGAAGC GCCGGAAAAG GACAAAGCCC TGCTTCCTGA TCAAAAAAAA CTGCTTTCCT GGATCAGTGA TTCCGGCAGG AACCTGCAGC CCTACCGGAG CGAGGCCGTT CCATCAAAAC TGCTGGACAA CCTACCGGAG GGCGGTGTTA TAAGTGCGGC AAAAACCAAT GTCGGCACCG GGGTTTCCGA ACTCATTTTA GGCAATGGTG CCCGGGTAAT ACTCAAACCG ACACGTTTTA AAAATGACCA GATCATCATC AATGGATACA GTTATGGCGG TACATCTATT GCCGCAGATT CTATATATCA TTCAGCCGCG CTGGCAGCAT TACTGGTAAA CCGGAGTGGC CTGGGCAAGC TGACACGGGC ACAGCTGAAT AAAATGCTGA GCGGCCGTTC GGTAAATCTT TCGGCCTCCA TCAGCGACTT TACAGAAGGT CTGAGTGGAA GTGCCTCGCC GGCTGAACTG GAAACAGCAC TGCAATTGAT ATACCTGTAT TTTACACAGC CCCGCAAAGA TCCTGAGGCC TGGTTGGCCA TCATTTCACA ACAGGAAGCC AGTATGGCCA ACCGAAGTGC AAGTCCTAAC CTGGTTTTTC AGGATACCGT TACAGCTGTT TTAAACAGCT ATAACCCCAG AAAAACAGCG GGAAACCTGG ATGGTACATC GATAGACCAG GCTTATCGTT TCTATCAGGA TCGTTTTGCT GATGCCAGTG ATTTTACTTT TATACTGGTG GGGGCGATAG ATACGGCAAA GGCCATTCCC CTGATCAAAA AATACCTGGG AAACCTGCCG GCCATCCACA GAAAAGAGCA CTACAAAGAC GCTGGCTTTG GCACGCCCAG GGGCCAGGTA AGCAAAACCG TTTATAAGGG TATCGAAGCA AAAAGCAGGG TGCAGCTGGT ATATAGCGGC ACTTATACCT ACAATGACCT GAACAACATT CAGTTGGAAG CCTTAAAGGA AATGATCAAT TACAGGATCC TGAACCGTTT AAGGGCAAAA GAAAGCGGTG TATATACCCC ATCTGTAAAT GTGGGTTATG GCAATATCCC CGTTCAAAGA TACAGCATCA CCATCAGCTT TAACTGTGCC CCCGAAAATG TACAGCATTT AATTGAGGCC AGCAAAGAAG AAGTGGAGCG CTTAAAAAGA GAAGGGCCCG AACCGGCAGA AATACAGAAG TTTATGGCAG CGCAACTGCG CATGCGCGAA ACGCAGGTGG AAAGCAATAC CTGGTGGGTA TATTACCTGA GAAACCAGTA CATGAACAGG GATAAACCCG AAACAGAACC GTCCTACAAT AAGCTTCTTG GTGAAGTAAG CCCTGAAAGT GTCCGCACTG CTGCGCGGCA ATATGCCGGT GGCGAAAATC TGGCCGAATT TATCTTAATG CCGGAAAAGA AAGGCCCCCG CCATTAG
|
Protein sequence | MKNHFLFFTV MLLYAVPLMA QHTTKTTAKA GLLPVDTAVK IGKLPNGLTY YIRKNSEPAK RAVLYLVTHV GSLMEDDDQL GLAHFTEHMA FNGTRDFPKN ELVNYLQKAG VKFGADVNAS TSFNETIYKL PLPTDSMAVF KKSFSMMANW AGLVTFEEGA INRERGIILE EERSRGKNVA ERIQKQVIPA LLNNSRYASR MPIGKDELLK TFKPDVIKRF YKDWYRPDLQ AVIAVGDFDP KLVERLIREN FSTLKNPMHS RKRINYSIPP DKGTQIKIIT DPEQTSTRMQ IIVRHQGKAV RTSADLIESL SRSLLNRMLG SRVAELRQQA NAPLLIGSIG YGRFLADIDA FTIAVTAKPG QLEQAAKKIL AVNEQARQFG FTQAELESAK KSLFNTIERQ WKERDKNSSA AYVTEYLNHF TKGEAIPGID YEYHFLKNNL AKIKLEQLDK LIRALNSTEN RVIIIEAPEK DKALLPDQKK LLSWISDSGR NLQPYRSEAV PSKLLDNLPE GGVISAAKTN VGTGVSELIL GNGARVILKP TRFKNDQIII NGYSYGGTSI AADSIYHSAA LAALLVNRSG LGKLTRAQLN KMLSGRSVNL SASISDFTEG LSGSASPAEL ETALQLIYLY FTQPRKDPEA WLAIISQQEA SMANRSASPN LVFQDTVTAV LNSYNPRKTA GNLDGTSIDQ AYRFYQDRFA DASDFTFILV GAIDTAKAIP LIKKYLGNLP AIHRKEHYKD AGFGTPRGQV SKTVYKGIEA KSRVQLVYSG TYTYNDLNNI QLEALKEMIN YRILNRLRAK ESGVYTPSVN VGYGNIPVQR YSITISFNCA PENVQHLIEA SKEEVERLKR EGPEPAEIQK FMAAQLRMRE TQVESNTWWV YYLRNQYMNR DKPETEPSYN KLLGEVSPES VRTAARQYAG GENLAEFILM PEKKGPRH
|
| |