Gene Mlab_0776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0776 
Symbol 
ID4795314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp757420 
End bp759318 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content55% 
IMG OID640099438 
ProductATP-dependent protease Lon 
Protein accessionYP_001030214 
Protein GI124485598 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000293467 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000496506 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGTATGCCT CTGACGGATA TGATGCCGAC CTTTTCGGCG GCGTTCGGTT TGACACCACG 
GCGGATCTCG TAATCCCCCC GTCCCTGATC GATCAGGTCA TCGGTCAGGA ACACGCAGTG
GATGTTATCA GAAAAGCCGC CACTCAGCGC AGGCACGTCA TGATGATCGG AAGTCCCGGT
ACCGGCAAGT CGATGCTTGC TAAGGCCATG TCCGAACTCC TCCCGAAAGA GGAAATGCAG
GACATCCTGA CGTACCCGAA TCCTGAGGAT AACAATAATC CGATCATCCG GGTGGTTCCT
GCAGGAAGAG GAAAAGAGAT CGTCGCGGCC CACAAAGAAG AGGCCCGCAA ACGTGCCTCC
TCGAAGAACA CGATGCTTCT CATCCTCGTC ATTGGTATTC TTGGGATCGC TCTGATTTCA
GGCCAGCTTC TGATGGGTAT CGTCGCCGTG GCCTTTATCT TCATGGCATT TCGTTCATTC
ATGCCCAAAG AAACTGCGAT GGTTCCAAAA CTCATCGTCT CCAACAAACC AGACTCGACC
GCACCTTTCG TGGACGGGAC CGGATCCCAC GCCGGAGCAC TGCTCGGAGA CGTTCGCCAC
GACCCGTTCC AGTCGGGAGG TCTCGAGACC CCTGCCCACG ACCGTGTAGA GGCCGGAGCA
ATTCACCGGG CACACAAGGG TGTTTTGTTC ATTGATGAAA TGAACACCCT TGAACTCTCT
TCCCAACAGA GTCTGTTGAC GGCACTTCAG GAAGGCGAGT TCCCGATCAC CGGTCAGTCC
GAGCGTTCAT CTGGAGCCAT GGTCAGAACC GAACCGGTTC CATGCCGGTT CCTGATGATC
GCAGCAGGAA ATCTCGATGC TGTCCAGCAT ATGCACCCTG CACTTAGGAG CCGTATTCGC
GGATACGGAT ACGAGGTCTA CATGAGCGAG ACCATGAATG ATACGCCTGA AAACCGGGCA
AAGCTCGTCA GGTTCGTAGC TCAGGAAGTT AAAAACGATG GTAAGATCCC GCATTACGAC
CCATCGGCGG TATCCGAGAT CCTTCGCGAG GCAAAACGCC GGTCCGGCAG AAAAGGCCAC
CTGACAGTGA AGCTGAGAGA TCTCGGCGGT CTTGTCCGTG TCGCCGGTGA TCTCGCCATT
CAGGAAGGGT CCCCGGTCAC ATCGGTCCAT CATGTTGTCT CGGCAAAACA GATCGCAAGA
TCCGTTGAGG ATCAGATATC CGACGAGTAC ATCCGTCGGA CCCGGGATTA TGATCTGACG
ATCGTTTCCG GCAATCTTGT CGGAAGGGTG AACGGACTTG CCGTTGTCGG CAACGATGCA
GGATCGGTCC TTCCGATCAC AGCCGAGGTT ACCCCCTCGC AGGGCGCAGG GATGGTTATT
GCGACCGGTC TTTTGAAAGA AATCGCCCAG GAGTCAATCA AGAACGTGAG CGCCTTAATC
AAGAAGTTCT CCGGAACTGA CATCCGCAAA GTCGATATCC ATGTCCAGTT CATTGGAACA
TACAATGGGG TAGAGGGAGA TTCGGCATCC GTGACCGTCG CGACGGCGGT AATTAGTGCG
CTTGAAGACA TTCCGGTCCG GCAGGATGTG GCGATGACCG GATCCCTGTC GGTAAGAGGA
GATGTTCTTC CGATCGGCGG TGTCACCTAC AAGATCGAGG CAGCCGCAAA AGCCGGGATC
CGCACGATCA TTATCCCGCA GTCGAACCTT GCCGATGTTC TTATCGAAGA GCGGTATTCC
GATATGGTTT CCATCATTCC GGTGACCAGA ATCGAAGAAG TGCTCAGATA TGCACTCGTT
CCCGAGGACA AGGAGGCATT CGAACAGAAA CTCCTCCAGA TCGGCAAACA TATGGATATC
CCGAAAATGC CGATGCCCGC CGACAACGTT GCAGCGTGA
 
Protein sequence
MYASDGYDAD LFGGVRFDTT ADLVIPPSLI DQVIGQEHAV DVIRKAATQR RHVMMIGSPG 
TGKSMLAKAM SELLPKEEMQ DILTYPNPED NNNPIIRVVP AGRGKEIVAA HKEEARKRAS
SKNTMLLILV IGILGIALIS GQLLMGIVAV AFIFMAFRSF MPKETAMVPK LIVSNKPDST
APFVDGTGSH AGALLGDVRH DPFQSGGLET PAHDRVEAGA IHRAHKGVLF IDEMNTLELS
SQQSLLTALQ EGEFPITGQS ERSSGAMVRT EPVPCRFLMI AAGNLDAVQH MHPALRSRIR
GYGYEVYMSE TMNDTPENRA KLVRFVAQEV KNDGKIPHYD PSAVSEILRE AKRRSGRKGH
LTVKLRDLGG LVRVAGDLAI QEGSPVTSVH HVVSAKQIAR SVEDQISDEY IRRTRDYDLT
IVSGNLVGRV NGLAVVGNDA GSVLPITAEV TPSQGAGMVI ATGLLKEIAQ ESIKNVSALI
KKFSGTDIRK VDIHVQFIGT YNGVEGDSAS VTVATAVISA LEDIPVRQDV AMTGSLSVRG
DVLPIGGVTY KIEAAAKAGI RTIIIPQSNL ADVLIEERYS DMVSIIPVTR IEEVLRYALV
PEDKEAFEQK LLQIGKHMDI PKMPMPADNV AA