Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0715 |
Symbol | |
ID | 4268104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 798996 |
End bp | 799901 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638125464 |
Product | HtpX domain-containing protein |
Protein accession | YP_741559 |
Protein GI | 114319876 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.931885 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCA TAGGGCTCTT TCTGCTCACT AACCTGGCGA TCCTGGTGGT CCTGGGTGTG GTGTTGTTCA TCCTGCAGGC CGTGTTCGGG GTGCGGACCC TGGATGAGGC CGGTGTGGGT CTGGACTATA CGGGGTTGCT CATCATCGCC GCGGTGATCG GTTTCGGCGG CTCGTTCATC TCCCTGGCCA TGTCCAAGTT CATCGCCAAG CGGATGACCG GGGCCCGCGT TATCGAAAAG CCGCGCAGCG AGGCCGAGCA GTGGCTGGTG GATACCGTGC GCCGCTTTGC CCGCCAGGAG GGCATCGGTA TGCCGGAGGT GGCGATCTAC GACGCCCCGG ACATGAACGC CTTTGCCACC GGGGCCCGGC GCAATAACTC CCTGGTGGCG GTGAGCACCG GCCTGCTGCA GAGCATGACC CGGGACGAGG CCGAGGCGGT GATCGGCCAC GAGATCGCCC ACATCAGCAA CGGCGATATG GTCACCCTGA CCCTGATCCA GGGGGTGGTG AACACCTTTG TGGTCTTCTT CTCGCGCATC ATCGGCCATT TCGTCGACCG CGTGGTATTC AAGACCGAGC AAGGGCACGG CCCGGCCTAT TTCATCACCT CCATCTTCGC CCAGATCGTG CTGGGCATCC TCGCCTCGGT GATCGTCATG TGGTTCTCCC GTCAGCGGGA GTACCGGGCC GATGCCGGTG GCGCCAAGCT GGCCGGGCGC GACAAGATGA TAGCCGCGCT GGAGCGGCTA AAGCGCTCGG TGGACCAGGA GCACCTGCCG GACCAGCTGG AGGCCTTCGG CATCAACGGC AACCGCGGTG GCGGCATGAA GGAGTGGTTC ATGTCCCACC CGCCGCTGGA CGACCGCATC GCCGCGCTGA AGGAGGGCCG TCACCTGCGC GGATGA
|
Protein sequence | MKRIGLFLLT NLAILVVLGV VLFILQAVFG VRTLDEAGVG LDYTGLLIIA AVIGFGGSFI SLAMSKFIAK RMTGARVIEK PRSEAEQWLV DTVRRFARQE GIGMPEVAIY DAPDMNAFAT GARRNNSLVA VSTGLLQSMT RDEAEAVIGH EIAHISNGDM VTLTLIQGVV NTFVVFFSRI IGHFVDRVVF KTEQGHGPAY FITSIFAQIV LGILASVIVM WFSRQREYRA DAGGAKLAGR DKMIAALERL KRSVDQEHLP DQLEAFGING NRGGGMKEWF MSHPPLDDRI AALKEGRHLR G
|
| |