Gene Mlab_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0540 
Symbol 
ID4796139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp513545 
End bp515473 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content58% 
IMG OID640099198 
Producthypothetical protein 
Protein accessionYP_001029981 
Protein GI124485365 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0523546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.309902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA TACAAGGGAT GAGCGGAGCG GACGTAAAAG CGATGACAGC GGAGCTTGCC 
GCGCTTTTGC CGCTCTGGAT CGGGAAAATA TATCAGTACG ACAATGCCTC GCTTGGGTTC
CGGCTGAATG GTGAGGAGAA GGCACGCCAT CTTCTGTATG TGGTGAGAGG CATTCGGGCG
CATCTCGTAT CCGAACTGCC GCCGGCGCCG AAAAACCCGT CCGGGTTTTC GATGTATCTT
AGGAAATACA TCGAGGGGGG CAAGGTCCTC AATATCGAGC AGAAGGCGAT CGAGCGGGTC
ATCATAATAA CGATCGGCAA AGGGCCGTCG GAGTATAAGC TGATCATCGA ACTCTTCGAC
GAAGGAAACC TTATCCTGAC AGACGAAAAG TTCACGATCA TCAATGCCCT TGCCCAGCGG
CGGTTCCGGG ACCGGGATAT CGTCGGCGGC GCAGAGTATG CGATCGAAGC CGTCTGGCCC
GAGAGGCTGA CGTTCGAGGA GTTCAAAGAG AAGATCACCG CAGACGAGAA CGATATCGTG
CGGGCTCTCG CAACGAAACT GCAGCTCGGC GGCATCCCCT CGGAAGAGAT ATGTCAGCTT
GCGGGCGTCT CGAAGTCGAT GCCCTGTAAG TTCGCGACCG ATGTCCAGCT TCGCCCGGTG
TATGAGGCGA TGAAAAGCTG GATCGCCAAA CTGACCTATG CACGCGATCC GGTGATCGAT
GCGAAGGGAG CGTTCCCGTT CCCCTCACTG GTCCGCGAGC CAAAAGAGCA CTTCGCGACG
TTTTCGCAGG CGCTCGAGGC GTTCTATCCA AAACCGGTGG CCGAGAAGGT CATCGAACAG
AAGATCAAAC TCTCCAAAGA GGAGCGGATC CGAAAACAGC AGGAAGCAGC CGTCGTCAAC
TTCGACAAAA AGATCGCCGA GGCGACCGAG ATCTCGGAGA TCATCTACTC GCATTACGGC
GAGGTGCAGG AGACGATCGA CGTTCTTGCG GCCGCAAGCC AGAAGCTTTC CTGGCAGGAT
ATCGCGGCCG TGATCAAAAA GAGCGACCTG CCCGCCGCAA AACGGATCAT CTCGGTGGAC
CCGAAGAATG CGTCCGTCGT GATCGATCTG CAGGAAAAGC ACAAGGTCAC GATCTTCGTG
CACGAAAGTC TGGAGGCAAA CGTCGGCAGA TACTTCGCCG TCGTGAAAAA GTTCCGGGCG
AAAAAGGCGG GAGCGCTTCG GGCGATGGAA GCAGGCATCG TGCATGCGGA GAAGAAAAAG
GCGGCCGGAC CCGGCCGACT CAAGCCGAAA TGGTATCACC GTTTCCGGTG GATGGAGACC
TCCGACGGCG TGCTTGTGAT CGGCGGCCGA AATGCCGACC AGAACGAGGA GCTTGTCAAG
AAGTATATGG AAGGCAAAGA CACCTTCCTC CACGCAGATG TCTTCGGAGC GTCCGCGGTG
ATCGTGAAAG GCGTAACGGA ACGCATGGAT CAGGCGGTCC AGTTTGCCGC GTCCTACTCG
CGGGCATGGG CCGGCGGTGG GGCATCCGTC GATGTGATCG CAGCAAGCCC CAACCAGGTA
AGTAAGACGC CCGAGTCGGG AGAGTATGTG GCTCACGGTT CGTTCGTCAT CCGGGGCGAG
CGTAAGATCT ACAAGGACGT GCCGCTCGAG ATCGCGATCG GGGTCAGGAC CGAACCGGTC
CTTGCGGTGA TCGGCGGAAC GCCTTCCGCG ATCGAACCGC TGGCTTCGCA TTCCGTTCGT
CTGGTTCCGG GGACATTCGA AGGAAACGAC GTTGCTAAGA AAGTTCTTCG AAAACTCAAA
GAGGCGGTGC CTGAAAGCGA GCAGAAAGCG CTCAAAGCGA TCCTGAACAC CGAGGGCGTA
GCCGCGTTCG TGCCGCCGGG CGGGTCCGAT CTCAAAGAGC CGGCCCGAGA GGGAGCGGAC
CGCGAATGA
 
Protein sequence
MATIQGMSGA DVKAMTAELA ALLPLWIGKI YQYDNASLGF RLNGEEKARH LLYVVRGIRA 
HLVSELPPAP KNPSGFSMYL RKYIEGGKVL NIEQKAIERV IIITIGKGPS EYKLIIELFD
EGNLILTDEK FTIINALAQR RFRDRDIVGG AEYAIEAVWP ERLTFEEFKE KITADENDIV
RALATKLQLG GIPSEEICQL AGVSKSMPCK FATDVQLRPV YEAMKSWIAK LTYARDPVID
AKGAFPFPSL VREPKEHFAT FSQALEAFYP KPVAEKVIEQ KIKLSKEERI RKQQEAAVVN
FDKKIAEATE ISEIIYSHYG EVQETIDVLA AASQKLSWQD IAAVIKKSDL PAAKRIISVD
PKNASVVIDL QEKHKVTIFV HESLEANVGR YFAVVKKFRA KKAGALRAME AGIVHAEKKK
AAGPGRLKPK WYHRFRWMET SDGVLVIGGR NADQNEELVK KYMEGKDTFL HADVFGASAV
IVKGVTERMD QAVQFAASYS RAWAGGGASV DVIAASPNQV SKTPESGEYV AHGSFVIRGE
RKIYKDVPLE IAIGVRTEPV LAVIGGTPSA IEPLASHSVR LVPGTFEGND VAKKVLRKLK
EAVPESEQKA LKAILNTEGV AAFVPPGGSD LKEPAREGAD RE