Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1368 |
Symbol | |
ID | 4268131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1564990 |
End bp | 1566177 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126124 |
Product | peptidase M50 |
Protein accession | YP_742207 |
Protein GI | 114320524 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00624726 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGGG CACAGCGGGC GGTAAGATCG GGGCATGGAG CCGAGGCCGG GAGTGTGATC ATGTTCAAAT CAGTGCTGGT GCTGGGGTAC TACCGGGGCA TCCGGCTGGA GGTGCACGTC AGCTGGCTGG TCATCTTTGC CCTGCTTCTG GTCACCATGA GTGCCGGGTT TCACCACCAC TACGATCATT GGCCGCTGCC GGTAGCCATA CTCACGGCGC TGTTCACGTC GCTCACCTTT TTCGCCTCCA TCGTCGCCCA CGAACTCGGC CACAGCCTGG TGGCCATCCG TCGTGGGGTC CCGGTCAAGG CCATCACCCT GTTCATCTTC GGCGGGGTGG CCCAGATGAG CCGCGATGCC GACAGCCCCG ATGATGAGTT CTGGATCGCC ATTGCCGGAC CCGCGGTCAG TTTTGCCCTG GCGCTGCTTT TCGCCGCCCT GGCCCAGATC ACGGCGGGGA TTTTTGAGCC GCTGACCGTG GCCCTGGGCT GGTTGGCGGT GATCAACCTG GTGGTGGCCG TGTTCAACCT CATCCCCGGC TTCCCGCTGG ATGGCGGACG GGTCTTCCGC GCCGCGGTCT GGAAGTTCAC CGGCAGCGCG CGCAAGGGGA TCGAGGCCGC CGTGGCGGGT GGCCGGCTGG TCGCCTACGG GCTGTTTGCC CTGGCCTTGT GGAACATCCT GGTGCTGGGC AACCTAATCG GGGGGTTGTG GATCACCCTG ATCGCCTGGT TCCTGTTCAA TATGGCCCAG GCCCAGGGGC GAATGTTCGA CCTGCGCGAG CGCCTTTCCG GGGTGCGGGC CCGCGATCTG GCGCGGCCCG ACATCCCCCA GGTCGAGCCC GGGACAGCCG TCAGTGACTG GGTCCATCAC CAGGTGCTGC CGGGGGGGCA GCGCGCCCAT ATCGTTGGCA ATCGCGAGCA CGCCCATGGG CTGGTCTCTC TCTCCGATGC CCGGGCGGTG CCACAGGCGC AGTGGGCCAC CACCCGCGTC GACGACATCA TGACCCCGGC GGAGGCCCTG GTCAGTGCGA CGCCGGAGAC CGACGCGGCC CAGGTCCTAC AACTGATCAC CGAGCACAAC CTCAATCAGC TCCCGGTGAT GGAGGGGCGC CGCGTGTTGG GTTGGATCGA CCGCCATCAA CTGCTGCATA CCATCGATCT GCACATGGAG CTGAGGCGGC CGGAGTGA
|
Protein sequence | MRRAQRAVRS GHGAEAGSVI MFKSVLVLGY YRGIRLEVHV SWLVIFALLL VTMSAGFHHH YDHWPLPVAI LTALFTSLTF FASIVAHELG HSLVAIRRGV PVKAITLFIF GGVAQMSRDA DSPDDEFWIA IAGPAVSFAL ALLFAALAQI TAGIFEPLTV ALGWLAVINL VVAVFNLIPG FPLDGGRVFR AAVWKFTGSA RKGIEAAVAG GRLVAYGLFA LALWNILVLG NLIGGLWITL IAWFLFNMAQ AQGRMFDLRE RLSGVRARDL ARPDIPQVEP GTAVSDWVHH QVLPGGQRAH IVGNREHAHG LVSLSDARAV PQAQWATTRV DDIMTPAEAL VSATPETDAA QVLQLITEHN LNQLPVMEGR RVLGWIDRHQ LLHTIDLHME LRRPE
|
| |