Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2135 |
Symbol | |
ID | 4269875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2430812 |
End bp | 2431750 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126891 |
Product | peptidase U32 |
Protein accession | YP_742967 |
Protein GI | 114321284 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.449129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGGG AGCGAATGCC CATGCAAGTG ACCCTGGGGC CGGTGCCCTA TTACTGGCCG GCCGAGCGCC TGAGCGCCTT CTACCAGTCG GTGGCCGACT GGCCGGTGGA TACCGTTTAT CTGGGGGAGA CGGTGTGCAG CAAGCGCCGC GCCTTCAGCC TGGATGACTG GATCGGCTTG GGCCGGGAGC TGCGCCAGGC CGGTAAGGCG GTGGTGCTCT CCAGCCTGGC GCTGGTGGAG GCGCGTTCGG AGATGGGCGT GGTCCGGCGC CTGTGCGAGA ACGGCGAATT CCGGGTGGAG GCCAACGACA TGGGGGCGGT GCAGTTGGCG CGGGAGGCGG GTGCCGCCTT CGTCGCCGGG CCCACCCTGA ATGTCTACAA CCCCCGCACC CTGGCAGTGC TGCAATCGGC GGGGATGCGG GTCTGGGTGC CCCCGGTGGA GATGCCGGAG GAGATGGTCC GCGACAATGT TGACGGACTG GGGCAGGCGG CCGAGGGCCT GGAGGTCGAG GTTCAGGTCT TCGGCCCGCT ACCGCTGGCC TGGTCGGCGC GGTGCTTTAC CGCCCGGGCG GTGGACCGGC CCAAGGATCA ATGCGGCTTT CGCTGCCTGG AACACCCCAG CGGCCTGCCC ATGGCCACCC GGGACGGCGA TGAGTTCCTG CGTATCAACG GCATCCAACT GCAGTCCGGC CGCTGGCTGC ACCTGCTGGA CGCCCTGGAC TCCGTCCGCG ACGCGGGGGC CACCGCCATC CGTCTGAGTC CCTGGGGCGA GGATATGGCC GCGGTAGTGC ACGCCTACGC GGCGGTCGCC GCGGGCCGGC CGGCGCCGGC CCTGATCCCG GGACTGGACC CGGCGGCCTG TTGCGATGGC TACTGGTTCG GCGAGGCTGG CATGGCACGG CTGGCTCGCC AGGCACAAGA GCAGGAGGAA CAGGTATGA
|
Protein sequence | MMRERMPMQV TLGPVPYYWP AERLSAFYQS VADWPVDTVY LGETVCSKRR AFSLDDWIGL GRELRQAGKA VVLSSLALVE ARSEMGVVRR LCENGEFRVE ANDMGAVQLA REAGAAFVAG PTLNVYNPRT LAVLQSAGMR VWVPPVEMPE EMVRDNVDGL GQAAEGLEVE VQVFGPLPLA WSARCFTARA VDRPKDQCGF RCLEHPSGLP MATRDGDEFL RINGIQLQSG RWLHLLDALD SVRDAGATAI RLSPWGEDMA AVVHAYAAVA AGRPAPALIP GLDPAACCDG YWFGEAGMAR LARQAQEQEE QV
|
| |