Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1975 |
Symbol | |
ID | 4268518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2244278 |
End bp | 2246197 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638126731 |
Product | FtsH peptidase |
Protein accession | YP_742807 |
Protein GI | 114321124 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.182239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.00765382 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCAAGA ATCTGATCCT CTGGATCATC ATCGCCGTGG TCCTGATGTC CGTGTTCAGC AACTTCCAGG AACGCGGTGC CCCGGGGCCG GAGCAGATGA AATACTCCGA GTTCCTCCGC CATGTGGAGG CCGGAGAGGT CCGTGAGGTG ATGATCCAGG GCGATAAGAT CACCGTGGAG CGCGCCGATG GCACGCGGGC CCACACCTAC AACCCGGAGT CGGACAACCG GGCGCTGATT GGCACCCTGC TGGAGCACGA CGTCACCATC GATGCCCGCG AGCCGGAAGG CCGCAACATG TTGGTGCAGA TTCTCATCTC CTGGTTCCCG TTCCTGCTGC TGATTGCCGT GTGGATCTAC TTCATGCGCC AGATGCAGGG GGGTGGCGCG GGTCGTGGGG CCATGTCCTT CGGTAAGAGC AAGGCCAAGC TCATGACCGA GGAGCAGAGC AAGGTCACCT TCAACGACGT GGCCGGCTGC GACGAGGCCA AGGACGACGT GGCGGAGCTG GTGGACTTCC TGCGGGATCC GTCCAAGTTC CAGCGCCTGG GCGGGACCAT CCCCCGCGGG GTGTTGATGG TCGGCCCCCC CGGCACCGGT AAAACGCTGC TCGCCAAGGC GATCGCCGGT GAGGCCAAGG TGCCGTTCTT CAGCATCTCC GGCTCCGATT TCGTCGAGAT GTTCGTCGGC GTGGGTGCTT CCCGCGTGCG CGATATGTTC AGCCAGGCCA AGAAGCACAG CCCCTGCATC ATCTTCATCG ACGAGATCGA CGCCGTGGGC CGCCAGCGCG GTGCCGGCCT GGGCGGGGGG CACGACGAGC GTGAGCAGAC CCTGAACCAG CTGCTGGTGG AGATGGACGG TTTCGAGGGC AATGAGGGCG TCATCGTCAT CGCCGCCACC AACCGCCCCG ACGTGCTCGA CCCGGCCCTG CTGCGCCCGG GCCGTTTTGA TCGCCAGGTG GTGGTGCCGT TGCCGGACGT ACGCGGCCGC GAGCACATCC TCAAGGTGCA CATGAAGAAG GTGCCCCTGG ACGACGATGT CACCCCCGCG ATCCTGGCCC GCGGCACCCC GGGCTTCTCG GGCGCTGACC TGGCCAACCT GGTCAACGAG GCCGCCCTGT TCGCCGCCCG CGCCAACAAG CGGGTGGTGG ACCAGGAGGA CTTCGAGAAG GCCAAGGACA AGATCATGAT GGGCTCCGAG CGTAAGTCCA TGGTGATGAA GGAGGATGAG AAGCGCCTGA CCGCCTACCA CGAGGCCGGC CACGCCATCG TCGGGTTGGT CACCCCGGAG CACGATCCGG TGCACAAGGT GACCATCATC CCCCGCGGCC GGGCCTTGGG CGTGACCATG TTCCTACCGG AAGAGGACCG TTACAGCTAC ACCAAGCAAC GCCTGAACAG CATGATCGCC AGCCTGTTCG GCGGCCGTAT CGCCGAGGAG CTGATCTTCG GCCACGAGCG GGTGACCACC GGCGCCTCCA ATGACATCCA GCGGGCCACC GAGATTGCCC GCAACATGGT CACCAAGTGG GGGCTGTCCG CCCGCCTGGG CCCCCTGGCC TACGGAGATG AGGAGGGCGA GGTCTTCCTC GGTCATTCGG TGACCCAGCA TAAGGACGTT TCGGAAGAGA CCCAGCACGC CATCGACGAG GAGGTGCGGG CCATCATCGA CGCCAACTAC ACGGCGGCGG AGAAGATCAT CCGCGAGCAC ATGGATCAGC TTCACGTGAT GGCCGATGCA CTGATGCGCT ACGAGACCAT CGATCGTGCT CAGATCGACG ACATCATGGC CGGGCGCGAG CCGCGGCCGC CGAAGGGCTG GTCCGGGCCT TCGGACGATG ACGATGACCG TGCCGAGGGC GAGAAGGTGG GTAAGGACGA GCGCGGCCGT ATTGGCGGCC CCGCCACCAG CGAACACTGA
|
Protein sequence | MAKNLILWII IAVVLMSVFS NFQERGAPGP EQMKYSEFLR HVEAGEVREV MIQGDKITVE RADGTRAHTY NPESDNRALI GTLLEHDVTI DAREPEGRNM LVQILISWFP FLLLIAVWIY FMRQMQGGGA GRGAMSFGKS KAKLMTEEQS KVTFNDVAGC DEAKDDVAEL VDFLRDPSKF QRLGGTIPRG VLMVGPPGTG KTLLAKAIAG EAKVPFFSIS GSDFVEMFVG VGASRVRDMF SQAKKHSPCI IFIDEIDAVG RQRGAGLGGG HDEREQTLNQ LLVEMDGFEG NEGVIVIAAT NRPDVLDPAL LRPGRFDRQV VVPLPDVRGR EHILKVHMKK VPLDDDVTPA ILARGTPGFS GADLANLVNE AALFAARANK RVVDQEDFEK AKDKIMMGSE RKSMVMKEDE KRLTAYHEAG HAIVGLVTPE HDPVHKVTII PRGRALGVTM FLPEEDRYSY TKQRLNSMIA SLFGGRIAEE LIFGHERVTT GASNDIQRAT EIARNMVTKW GLSARLGPLA YGDEEGEVFL GHSVTQHKDV SEETQHAIDE EVRAIIDANY TAAEKIIREH MDQLHVMADA LMRYETIDRA QIDDIMAGRE PRPPKGWSGP SDDDDDRAEG EKVGKDERGR IGGPATSEH
|
| |