Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0776 |
Symbol | |
ID | 4795314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 757420 |
End bp | 759318 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640099438 |
Product | ATP-dependent protease Lon |
Protein accession | YP_001030214 |
Protein GI | 124485598 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000293467 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000000000496506 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | GTGTATGCCT CTGACGGATA TGATGCCGAC CTTTTCGGCG GCGTTCGGTT TGACACCACG GCGGATCTCG TAATCCCCCC GTCCCTGATC GATCAGGTCA TCGGTCAGGA ACACGCAGTG GATGTTATCA GAAAAGCCGC CACTCAGCGC AGGCACGTCA TGATGATCGG AAGTCCCGGT ACCGGCAAGT CGATGCTTGC TAAGGCCATG TCCGAACTCC TCCCGAAAGA GGAAATGCAG GACATCCTGA CGTACCCGAA TCCTGAGGAT AACAATAATC CGATCATCCG GGTGGTTCCT GCAGGAAGAG GAAAAGAGAT CGTCGCGGCC CACAAAGAAG AGGCCCGCAA ACGTGCCTCC TCGAAGAACA CGATGCTTCT CATCCTCGTC ATTGGTATTC TTGGGATCGC TCTGATTTCA GGCCAGCTTC TGATGGGTAT CGTCGCCGTG GCCTTTATCT TCATGGCATT TCGTTCATTC ATGCCCAAAG AAACTGCGAT GGTTCCAAAA CTCATCGTCT CCAACAAACC AGACTCGACC GCACCTTTCG TGGACGGGAC CGGATCCCAC GCCGGAGCAC TGCTCGGAGA CGTTCGCCAC GACCCGTTCC AGTCGGGAGG TCTCGAGACC CCTGCCCACG ACCGTGTAGA GGCCGGAGCA ATTCACCGGG CACACAAGGG TGTTTTGTTC ATTGATGAAA TGAACACCCT TGAACTCTCT TCCCAACAGA GTCTGTTGAC GGCACTTCAG GAAGGCGAGT TCCCGATCAC CGGTCAGTCC GAGCGTTCAT CTGGAGCCAT GGTCAGAACC GAACCGGTTC CATGCCGGTT CCTGATGATC GCAGCAGGAA ATCTCGATGC TGTCCAGCAT ATGCACCCTG CACTTAGGAG CCGTATTCGC GGATACGGAT ACGAGGTCTA CATGAGCGAG ACCATGAATG ATACGCCTGA AAACCGGGCA AAGCTCGTCA GGTTCGTAGC TCAGGAAGTT AAAAACGATG GTAAGATCCC GCATTACGAC CCATCGGCGG TATCCGAGAT CCTTCGCGAG GCAAAACGCC GGTCCGGCAG AAAAGGCCAC CTGACAGTGA AGCTGAGAGA TCTCGGCGGT CTTGTCCGTG TCGCCGGTGA TCTCGCCATT CAGGAAGGGT CCCCGGTCAC ATCGGTCCAT CATGTTGTCT CGGCAAAACA GATCGCAAGA TCCGTTGAGG ATCAGATATC CGACGAGTAC ATCCGTCGGA CCCGGGATTA TGATCTGACG ATCGTTTCCG GCAATCTTGT CGGAAGGGTG AACGGACTTG CCGTTGTCGG CAACGATGCA GGATCGGTCC TTCCGATCAC AGCCGAGGTT ACCCCCTCGC AGGGCGCAGG GATGGTTATT GCGACCGGTC TTTTGAAAGA AATCGCCCAG GAGTCAATCA AGAACGTGAG CGCCTTAATC AAGAAGTTCT CCGGAACTGA CATCCGCAAA GTCGATATCC ATGTCCAGTT CATTGGAACA TACAATGGGG TAGAGGGAGA TTCGGCATCC GTGACCGTCG CGACGGCGGT AATTAGTGCG CTTGAAGACA TTCCGGTCCG GCAGGATGTG GCGATGACCG GATCCCTGTC GGTAAGAGGA GATGTTCTTC CGATCGGCGG TGTCACCTAC AAGATCGAGG CAGCCGCAAA AGCCGGGATC CGCACGATCA TTATCCCGCA GTCGAACCTT GCCGATGTTC TTATCGAAGA GCGGTATTCC GATATGGTTT CCATCATTCC GGTGACCAGA ATCGAAGAAG TGCTCAGATA TGCACTCGTT CCCGAGGACA AGGAGGCATT CGAACAGAAA CTCCTCCAGA TCGGCAAACA TATGGATATC CCGAAAATGC CGATGCCCGC CGACAACGTT GCAGCGTGA
|
Protein sequence | MYASDGYDAD LFGGVRFDTT ADLVIPPSLI DQVIGQEHAV DVIRKAATQR RHVMMIGSPG TGKSMLAKAM SELLPKEEMQ DILTYPNPED NNNPIIRVVP AGRGKEIVAA HKEEARKRAS SKNTMLLILV IGILGIALIS GQLLMGIVAV AFIFMAFRSF MPKETAMVPK LIVSNKPDST APFVDGTGSH AGALLGDVRH DPFQSGGLET PAHDRVEAGA IHRAHKGVLF IDEMNTLELS SQQSLLTALQ EGEFPITGQS ERSSGAMVRT EPVPCRFLMI AAGNLDAVQH MHPALRSRIR GYGYEVYMSE TMNDTPENRA KLVRFVAQEV KNDGKIPHYD PSAVSEILRE AKRRSGRKGH LTVKLRDLGG LVRVAGDLAI QEGSPVTSVH HVVSAKQIAR SVEDQISDEY IRRTRDYDLT IVSGNLVGRV NGLAVVGNDA GSVLPITAEV TPSQGAGMVI ATGLLKEIAQ ESIKNVSALI KKFSGTDIRK VDIHVQFIGT YNGVEGDSAS VTVATAVISA LEDIPVRQDV AMTGSLSVRG DVLPIGGVTY KIEAAAKAGI RTIIIPQSNL ADVLIEERYS DMVSIIPVTR IEEVLRYALV PEDKEAFEQK LLQIGKHMDI PKMPMPADNV AA
|
| |