Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1900 |
Symbol | |
ID | 8253004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2194384 |
End bp | 2195625 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644935551 |
Product | proteinase inhibitor I4 serpin |
Protein accession | YP_003092170 |
Protein GI | 255531798 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.274558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA ACTACAAATT ACGCTTAGTT GTTGCAGTTT TAATGTGCGT GGCTACATCC TGCAAAAAGG GCAACAAAAC GCCAGACACC GGAAAAGATT TAATATTGAC CGAAAGAGAG CAGCAAAAGG CAGAAGCCGA TAATGCCTTT ACCTTAAAGC TGTTTAAGGA AATTATAGCC AAACCTTTGG CGGGTAAAAA CCTTATGTTG TCGCCTTTAA GTGTAAGTAT AGCATTGGGG ATGACCAGTA ACGGAAGCAG CGGCACAACT TTGGAAGCCA TAAGAAATAC CATGGAATTT AAGGATTTTA CTGAAGCTGA AATAAACAGC TATTACCATA AAATAGCAAC AGAATTACCC CAGTTAGATC CTAAGGCTTC ATTAAAAATT GCCAATTCCA TCTGGTACAG AAATACATTT ACCACATTGC CTGCTTTTCT TAATGTGAAC AGGGACAATT ACAATGCTGC AGTTGAGGGC CTGGACTTTG CCAATCCTGC TGCAAAAGAC AAAATTAACA ATTGGGTAAA TAACAGCACA AATGGAAAAA TCCCAACTAT AATTGATGCA ATTGGCAGCG ATATGGTCAT GTACCTGATC AATGCCGTTT ATTTTAAAAG CGACTGGAAG TATAAATTCG ATAAGGATAA AACTGCAAAA TCGGATTTTA ACCTGGACGC CAATAATAAG GTACAAACAG ATTTTATGGT TGCAAAGGCA ACTGTTAATC ACTTACGCTC AGAGGAGGCC TACATTTATG AACTTCCGTA TGGGAACGAA AAATACAGCA TGGTCATTGC ATTACCTGCT ACCAATACCA ATATTGCTGA ATTTGTCGCC TCAGTTAGTC CGGCAAAATG GAAAGGGTGG ATGGCAGGCC TGCAGAAAAC CGGTGTTGAA ATTAAAATGC CCAGGTTTAA ATTTAGCTAC AGCAGCATAT TAAACGATCA GCTAACCAAT CTGGGTATGG GAATTGCCTT TGGTAAAACC GGGGCAGCAG ATTTCAGCAG AATGAGTGCT GCAGGTTTAC AAATAAATGA GGTGAAGCAT AAAACCTTTG TTGAAGTAAA TGAAAGCGGT ACAGAAGCTG CCGCTGTAAC CTCTGTTGGG ATGGAGCTTA CTTCTGTACA AGAGCCGGCC CCGGTTCTTA TTAACCGCCC TTTTGTATTT GTGATCCGCG AGATGAAAAC CGGGCTGATT TTATTTACCG GTATCGTAAA TAACCCCTTA TTGGATAATT AA
|
Protein sequence | MKKNYKLRLV VAVLMCVATS CKKGNKTPDT GKDLILTERE QQKAEADNAF TLKLFKEIIA KPLAGKNLML SPLSVSIALG MTSNGSSGTT LEAIRNTMEF KDFTEAEINS YYHKIATELP QLDPKASLKI ANSIWYRNTF TTLPAFLNVN RDNYNAAVEG LDFANPAAKD KINNWVNNST NGKIPTIIDA IGSDMVMYLI NAVYFKSDWK YKFDKDKTAK SDFNLDANNK VQTDFMVAKA TVNHLRSEEA YIYELPYGNE KYSMVIALPA TNTNIAEFVA SVSPAKWKGW MAGLQKTGVE IKMPRFKFSY SSILNDQLTN LGMGIAFGKT GAADFSRMSA AGLQINEVKH KTFVEVNESG TEAAAVTSVG MELTSVQEPA PVLINRPFVF VIREMKTGLI LFTGIVNNPL LDN
|
| |