Gene Mlab_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0206 
Symbol 
ID4795877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp196246 
End bp197901 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content54% 
IMG OID640098852 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein 
Protein accessionYP_001029649 
Protein GI124485033 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.609669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCAC AACTTGGCGG ACAGCCAATC TTAATCCTCA AAGAAGGAGG AAACCGTACT 
CGCGGACGGG ATGCACAGAG CATGAACATC GCAGCAGCAA AAGCCGTTGC CGGTGCCGTA
AGATCCACCC TTGGTCCGAA AGGCATGGAC AAGATGCTCG TTGACACCAT TGGAGATGTC
GTTATCACCA ATGACGGTGT GACCATCCTC AAAGAGATGG ACATTGAACA CCCGGCAGCA
AAGATGATGG TCGAGATCGC AAAGACTCAG GATGACGAAG TCGGAGACGG AACCACCACC
GCAGTCGTCA TTGCAGGCGA ACTCTTAAAG AAGTCCGAAG AACTTCTCGA GATGGACGTT
CACCCGACCG TCATCACCCT TGGATACAGA CAGGCAGCCG AGAAAGCACA GGAACTTCTC
CAGACCATCG CAATCGACGT CAAGGCAAAG GACACCGCGA TCCTTTCCAA GATCGCCGGA
ACCGCAATGA CCGGTAAAAA CGCAGAAGCT TCCAAAGACA AACTCTGTGA CCTGATCGTT
CGTGCAATCA CGCTCGTTGC AGATGCTGAT GGAACAGTTG ACACCGAGAA CGTCAAAGTC
GAGAAACGTG TCGGCGGCTC AATCGAAGAG TCCGAGATCA TCGAAGGTAT GATCATCGAC
AAGGAACGTG TCCACCCGGG CATGCCCAAA TCCGTCAAGA ACGCAAAGAT CCTTCTGCTC
AACGCAGCAG TCGAATACAA GAAGACCGAA GTCGATGCAG AGATCTCCAT CACCTCCCCA
GATCAGCTCC AGATGTTCCT CGATGAGGAA GAGCGCATGA TCAAAGGCAT CGTCGAGAAG
ATCAAAGCAT CCGGCGCAAA CGTCCTCTTC TGTCAGAAAG GTATCGACGA CATTGCTCAG
CACTACCTCT CCAAGGCAGG CATCTTCGCA ACCCGCCGTG TAAAGAAATC CGATATGGAG
AAACTTGCAC GTGCAACCGG CGGAGCACTC ATCTCCTCCA TCGACGCCAT CTCTGCTGAC
GAGCTTGGTG TTGCAGGAAT TGTTGAAGAA CGCAAAGTAG GCGGCGAAGA GATGATCTTC
GTTGAGAAAT GCAAGAACCC CAAAGCAGTC TCAATCATCA TCAAAGGCGG AACAGACCAC
GTTGTCGACG AACTCGGCCG TGCACTGGAA GATGCACTCC GTGTCGTTGC CTGTGTCGTT
GAAGACAAGA AGGTCGTCGC CGGAGGAGGA GCACCGGAAG TTGAGCTTTC CCTCAGACTC
CGCGAATACG CAGCAACCCA GGGCGGACGT ATCCAGCTCG CAATCGAAGC ATTCGCAGGC
GCCCTTGAAG TTATCCCGAG AACCCTTGCA GAGAATGCAG GTCTCGACCC AATCGACAAG
CTCGTAGAGC TCCGTGCAGC ACACGAGAAA GGCAAGAAGA CCTACGGTCT CGATGTCTTT
GAAGGAAAGG CAGTCGACAT GTGGGAAGCA GGCGTTGTAG AGCCGCTCCG CGTAAAGACC
CAGGCTATTT CCTCAGCAGC AGAAGCCGCA GTCATGATTC TCAGAATCGA TGATGTCATC
GCATCCGCAA AGTCCGCAGG ACCATCACCC GAAGAGATGG CCGCAATGGG CGGAGGCATG
GGCGGTATGG GCGGCATGCC CCCAGGAATG ATGTAA
 
Protein sequence
MSAQLGGQPI LILKEGGNRT RGRDAQSMNI AAAKAVAGAV RSTLGPKGMD KMLVDTIGDV 
VITNDGVTIL KEMDIEHPAA KMMVEIAKTQ DDEVGDGTTT AVVIAGELLK KSEELLEMDV
HPTVITLGYR QAAEKAQELL QTIAIDVKAK DTAILSKIAG TAMTGKNAEA SKDKLCDLIV
RAITLVADAD GTVDTENVKV EKRVGGSIEE SEIIEGMIID KERVHPGMPK SVKNAKILLL
NAAVEYKKTE VDAEISITSP DQLQMFLDEE ERMIKGIVEK IKASGANVLF CQKGIDDIAQ
HYLSKAGIFA TRRVKKSDME KLARATGGAL ISSIDAISAD ELGVAGIVEE RKVGGEEMIF
VEKCKNPKAV SIIIKGGTDH VVDELGRALE DALRVVACVV EDKKVVAGGG APEVELSLRL
REYAATQGGR IQLAIEAFAG ALEVIPRTLA ENAGLDPIDK LVELRAAHEK GKKTYGLDVF
EGKAVDMWEA GVVEPLRVKT QAISSAAEAA VMILRIDDVI ASAKSAGPSP EEMAAMGGGM
GGMGGMPPGM M