Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1343 |
Symbol | |
ID | 4270016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1541412 |
End bp | 1542857 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126096 |
Product | protease Do |
Protein accession | YP_742182 |
Protein GI | 114320499 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.139056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.641282 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGAC CGTCAAGCTA CGGATACCTG GCTGCGCTGC TGGCGCTGGC CCTGCTGACG TTTGGCGCCA GCGCCGGCGC CAAGTCGCAT CTGCCGGACT TCACCGAACT GGTTGAGGAG AACAGCCCGG CAGTGGTCAA TATCAGCACC CGCCAAACCC CCGAGAGCCG GGGCGGGAGC GGGCGTCTGC CGGACCATTT CGATATCCCG GAGGACCACC CGCTGCGTGA TTTCATGGAG CGGTTCTTTG GCGAGCGGGG TGAGCGGCCA CCCGAGCACG GCCAGCGCCG CCCGCGCTCT CTCGGGTCAG GTTTCATCAT CTCCGAAGAC GGGTATGTCC TGACCAATCA CCACGTCATC GACGGTGCGG ATGAGGTCAA CGTCCGGCTG AGTGACCGGC GCGAGTTCGT GGCCGAGGTC ATCGGCAGCG ATGAGCGCAG TGACGTGGCG GTGCTCAAGA TCGATGCCGA GGGGCTGCCC ACGGTGCGCA TCGGCCAGTC CGACACGTTG CGCGTCGGCG AATGGGTGTT GGCCATCGGC TCGCCCTTCG GTTTTGAGCA CTCGGCCACC GCCGGGATTG TCAGCGCCAA GGGGCGCAGC CTGCCCAGCG GCAACTATGT GCCCTATCTG CAGACCGATG TGGCGATTAA TCCCGGCAAT TCCGGCGGCC CGCTGTTCAA CCTGGACGGT GAGGTGGTCG GCATCAACTC CCAGATCTAT AGCCGCACCG GCGGTTTCAT GGGGGTCTCC TTCTCCATCC CCATCGAGCT GGCCATGGAC GTGGCCACCC AGCTGCGGGA GACCGGGCGT GTGGCTCGGG GCTGGCTCGG GGTGATCATC CAGGACGTCA CCCGGGACCT GGCCGAGTCG TTCGATATGG ACCGGCCGCG TGGCGCCCTG GTGGCCCAGG TGCTTTCGGA CAGCCCGGCC CTTGAGGCGG ATCTGCAACC CGGCGATATC ATCGTCGAGT TCGACGGCGA GGCGGTGGAG ACCTCCGGCA GCCTGCCGCC GATGGTGGGG GCTACGCCGG TGGGTGCGGA GGTGCAGGTA AAGGTGCTGC GCGAGGGCCG CGAGGTGATG GTTGATGTCA CCATCGGCGA GCTGCCCGAG GAGCAGGCGC GGGCCCAGCG CCCGCCCCGG GGTGAGCCGG AGCGTGCGCC GGATACCGCC GCGGAGCGAC TGGGGCTGCG GGTGGAGCCG GTGCCCGCCG AGCGCCTGGA GGAGCTGCGC GTCGACAGTG GTGTGCTGGT CCGGCGCGTG GAGAGCGGCC CGGCGCGGGA GGCCGGCATC CGCCCCGGTG ATGTGATCAC CTCCATCGAT CAGCAGTCCG TGGAAGGCGT GGAGCAATTC GCAGAGTTGG TCGAAGGCCT GGCCTCCGGG CGCAGTGTCC CGGTGCTGGT GCTGCGGGAG GGGGGGGCGC GTTTCTTCGC CCTGCGTATC CCCTGA
|
Protein sequence | MNRPSSYGYL AALLALALLT FGASAGAKSH LPDFTELVEE NSPAVVNIST RQTPESRGGS GRLPDHFDIP EDHPLRDFME RFFGERGERP PEHGQRRPRS LGSGFIISED GYVLTNHHVI DGADEVNVRL SDRREFVAEV IGSDERSDVA VLKIDAEGLP TVRIGQSDTL RVGEWVLAIG SPFGFEHSAT AGIVSAKGRS LPSGNYVPYL QTDVAINPGN SGGPLFNLDG EVVGINSQIY SRTGGFMGVS FSIPIELAMD VATQLRETGR VARGWLGVII QDVTRDLAES FDMDRPRGAL VAQVLSDSPA LEADLQPGDI IVEFDGEAVE TSGSLPPMVG ATPVGAEVQV KVLREGREVM VDVTIGELPE EQARAQRPPR GEPERAPDTA AERLGLRVEP VPAERLEELR VDSGVLVRRV ESGPAREAGI RPGDVITSID QQSVEGVEQF AELVEGLASG RSVPVLVLRE GGARFFALRI P
|
| |