Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3670 |
Symbol | |
ID | 5111918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 3977541 |
End bp | 3978650 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640493875 |
Product | serine endoprotease |
Protein accession | YP_001178378 |
Protein GI | 146313304 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.932831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.311262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTATGCT GCATCCGTCC CTTAATTAAC GACGCCCGCA TCATGTTTCT TAAGCTATTT CGTTCAGTTG CTATCGGTTT GGTCGTTGGC GGCCTGTTAT TGGCTGCAAT GCCCTCTTTA CGTCAGTTAA ATCAGCTGGC GGCACCGCGT TTCGACAGCA CCGATGAGAC GCCTGTGAGC TACAACCAGG CGGTTCGCCG TGCTGCGCCT GCTGTGGTTA ACGTCTATAA CCGCGGCCTC AACAGTTCCG CGCATAATCA ACTTGAGATT CGTACTCTCG GCTCCGGCGT GATCATGGAT GAGCGCGGCT ACATCATTAC CAACAAACAC GTCATTAACG ACGCCGATCA GATTATCGTC GCCCTGCAGG ATGGCCGCGT GTTTGAAGCC TTATTGGTTG GCTCGGACAC GCTGACCGAT CTCGCCGTCC TCAAAATCAA TGCCACGGGC GGGCTGCCGG TGATTCCGAT CAACCGTAAA CGCACGCCGC ATATCGGCGA TGTGGTGATG GCAATTGGTA ACCCGTATAA CCTTGGGCAA ACCATTACCC AGGGGATTAT CAGCGCCACC GGTCGTATCG GTTTGAATCC CTCCGGGCGA CAAAACTTCC TGCAAACGGA CGCCTCCATT AACCACGGTA ACTCCGGCGG CGCGCTGGTG AATTCGCTGG GAGAATTAAT GGGCATCAAT ACCCTGTCGT TCGATAAAAG TAACGATGGC GAAACGCCTG AGGGCATCGG TTTTGCGATC CCGTTCCAGC TGGCCAACAA AATTATGGAC AAGCTGATTC GCGATGGTCG CGTCATTCGC GGCTATATTG GCATCGGTGG CCGAGAAATT GCGCCAATGC ACACTCAGGG CGGCGGCATC GATCAGATTC AGGGTATCGT GGTCAATGAA GTGACACCGG GTGGTCCGGC GGCTAACGCG GGGCTACAGG TGAACGACGT GATTGTTTCG GTCAATGGCA CGCCTGCCGT ATCCGCACTA GAAACAATGG ATCAGGTAGC AGAGATTCGT CCTGGCTCGA TTATTCCGGT CGAAGTCATG CGTAATGACA AAAAACTGAC GCTTCAGGTG ACGATTCAGG AATATCCCGC CACTAACTAA
|
Protein sequence | MLCCIRPLIN DARIMFLKLF RSVAIGLVVG GLLLAAMPSL RQLNQLAAPR FDSTDETPVS YNQAVRRAAP AVVNVYNRGL NSSAHNQLEI RTLGSGVIMD ERGYIITNKH VINDADQIIV ALQDGRVFEA LLVGSDTLTD LAVLKINATG GLPVIPINRK RTPHIGDVVM AIGNPYNLGQ TITQGIISAT GRIGLNPSGR QNFLQTDASI NHGNSGGALV NSLGELMGIN TLSFDKSNDG ETPEGIGFAI PFQLANKIMD KLIRDGRVIR GYIGIGGREI APMHTQGGGI DQIQGIVVNE VTPGGPAANA GLQVNDVIVS VNGTPAVSAL ETMDQVAEIR PGSIIPVEVM RNDKKLTLQV TIQEYPATN
|
| |