Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2213 |
Symbol | |
ID | 4268685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2518981 |
End bp | 2520207 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126969 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_743045 |
Protein GI | 114321362 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0322293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTGC CGCGCCTGCT GCGATTCCTG CTCAGTTACA CCGCCATCGG CCTGCTGGTG GCGGCGGTCA TCATCGGCCT GCGCCCGGAC CTGGTGGGCA TGCAGACCGG TAACAACGCC GCGCCGAACG ACCGCAATGG CAACGGGGCG GCTGCGCCCG CCACCCAGAC GCTAGCGCCG CCGATACAGC CCCGCACGGG GCCGGTGTCC TACGCCGACG CCGTGGAGCA GGCCCAACCG GCCGTGGTCA ACATCTACAC CGCCAAGACC GTGCAGGAGG CGCCGCACCC ACTGTTCGAC GACCCCTTCT TCCGCCGCTT TTTCGGCGAT GTCGCGCCCC ACCGGCCGCG GGAGCGCACG CAGACCAGCC TGGGTTCGGG GGTGATCTTC AGCGAACAGG GCTACGTCAT CACCAATAAC CACGTCATCG AGGACGCCGA CCAGATCCAG GTGCTGCTGG CGGATGGCCG GGAGGCCCTG GCCAGTGTGG TGGGCCGCGA CCCGGAGACC GATCTCGCGG TGCTGCGCAT CGAACTGGAC CGGCTGCCGG TGATCCAGTT GGCGGACGAC CGGGCGCTGC GCGTGGGCGA CGTGGTGCTG GCCATCGGCA ACCCCTTCGG CGTCGGCCAG ACGGTGACCA TGGGCATTGT CAGCGCCACC GGCCGCGATC AGCTCGGCCT GACCACCTTC GAGAACTTTA TCCAGACCGA CGCGGCCATC AACCCGGGCA ACTCCGGCGG CGCACTGATC AACGCCGAGG GCCGGCTGGT GGGCATCAAC ACCGCCATCT TCAGCCGCAC CGGGGGCCAC CAGGGCATCG GCTTCGCCAT CCCGGCCCAC CTGGCGGTCT CGGTGCTGCA AAGCATCGTC GAGGAGGGTC GCGTGGTGCG CGGCTGGATC GGGGTCCAGG CCCAGAGCCT GACCCCCATG CTGGCCGAGT CCTTCGACCT GGCGGCGGCA CAGGGCATTG TCATCTCCGG CGTGTTGCGC GGGGGCCCGG CGGACCGTGC CGGCCTGCGC CCCGGGGATA TCATCACCCA CATCGAGGGC GAACCGGCGG CCGATGCCCA GGCGCTGCTG GAGCGGGTCA CCGACAAGCG GCCCGGGAGC GAGCTGCGGC TGGATCTGCT GCGGGATGGC GAGGCGCGCA CGGTCACCGT GGCGGTGGGA GAACGCCCGG CCCAGGACGA GCGGCAGCCG GCCCCACGGC AGCCGCGGTT ACCCTGA
|
Protein sequence | MKVPRLLRFL LSYTAIGLLV AAVIIGLRPD LVGMQTGNNA APNDRNGNGA AAPATQTLAP PIQPRTGPVS YADAVEQAQP AVVNIYTAKT VQEAPHPLFD DPFFRRFFGD VAPHRPRERT QTSLGSGVIF SEQGYVITNN HVIEDADQIQ VLLADGREAL ASVVGRDPET DLAVLRIELD RLPVIQLADD RALRVGDVVL AIGNPFGVGQ TVTMGIVSAT GRDQLGLTTF ENFIQTDAAI NPGNSGGALI NAEGRLVGIN TAIFSRTGGH QGIGFAIPAH LAVSVLQSIV EEGRVVRGWI GVQAQSLTPM LAESFDLAAA QGIVISGVLR GGPADRAGLR PGDIITHIEG EPAADAQALL ERVTDKRPGS ELRLDLLRDG EARTVTVAVG ERPAQDERQP APRQPRLP
|
| |