Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0206 |
Symbol | |
ID | 4795877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | - |
Start bp | 196246 |
End bp | 197901 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640098852 |
Product | ABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein |
Protein accession | YP_001029649 |
Protein GI | 124485033 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.609669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCAC AACTTGGCGG ACAGCCAATC TTAATCCTCA AAGAAGGAGG AAACCGTACT CGCGGACGGG ATGCACAGAG CATGAACATC GCAGCAGCAA AAGCCGTTGC CGGTGCCGTA AGATCCACCC TTGGTCCGAA AGGCATGGAC AAGATGCTCG TTGACACCAT TGGAGATGTC GTTATCACCA ATGACGGTGT GACCATCCTC AAAGAGATGG ACATTGAACA CCCGGCAGCA AAGATGATGG TCGAGATCGC AAAGACTCAG GATGACGAAG TCGGAGACGG AACCACCACC GCAGTCGTCA TTGCAGGCGA ACTCTTAAAG AAGTCCGAAG AACTTCTCGA GATGGACGTT CACCCGACCG TCATCACCCT TGGATACAGA CAGGCAGCCG AGAAAGCACA GGAACTTCTC CAGACCATCG CAATCGACGT CAAGGCAAAG GACACCGCGA TCCTTTCCAA GATCGCCGGA ACCGCAATGA CCGGTAAAAA CGCAGAAGCT TCCAAAGACA AACTCTGTGA CCTGATCGTT CGTGCAATCA CGCTCGTTGC AGATGCTGAT GGAACAGTTG ACACCGAGAA CGTCAAAGTC GAGAAACGTG TCGGCGGCTC AATCGAAGAG TCCGAGATCA TCGAAGGTAT GATCATCGAC AAGGAACGTG TCCACCCGGG CATGCCCAAA TCCGTCAAGA ACGCAAAGAT CCTTCTGCTC AACGCAGCAG TCGAATACAA GAAGACCGAA GTCGATGCAG AGATCTCCAT CACCTCCCCA GATCAGCTCC AGATGTTCCT CGATGAGGAA GAGCGCATGA TCAAAGGCAT CGTCGAGAAG ATCAAAGCAT CCGGCGCAAA CGTCCTCTTC TGTCAGAAAG GTATCGACGA CATTGCTCAG CACTACCTCT CCAAGGCAGG CATCTTCGCA ACCCGCCGTG TAAAGAAATC CGATATGGAG AAACTTGCAC GTGCAACCGG CGGAGCACTC ATCTCCTCCA TCGACGCCAT CTCTGCTGAC GAGCTTGGTG TTGCAGGAAT TGTTGAAGAA CGCAAAGTAG GCGGCGAAGA GATGATCTTC GTTGAGAAAT GCAAGAACCC CAAAGCAGTC TCAATCATCA TCAAAGGCGG AACAGACCAC GTTGTCGACG AACTCGGCCG TGCACTGGAA GATGCACTCC GTGTCGTTGC CTGTGTCGTT GAAGACAAGA AGGTCGTCGC CGGAGGAGGA GCACCGGAAG TTGAGCTTTC CCTCAGACTC CGCGAATACG CAGCAACCCA GGGCGGACGT ATCCAGCTCG CAATCGAAGC ATTCGCAGGC GCCCTTGAAG TTATCCCGAG AACCCTTGCA GAGAATGCAG GTCTCGACCC AATCGACAAG CTCGTAGAGC TCCGTGCAGC ACACGAGAAA GGCAAGAAGA CCTACGGTCT CGATGTCTTT GAAGGAAAGG CAGTCGACAT GTGGGAAGCA GGCGTTGTAG AGCCGCTCCG CGTAAAGACC CAGGCTATTT CCTCAGCAGC AGAAGCCGCA GTCATGATTC TCAGAATCGA TGATGTCATC GCATCCGCAA AGTCCGCAGG ACCATCACCC GAAGAGATGG CCGCAATGGG CGGAGGCATG GGCGGTATGG GCGGCATGCC CCCAGGAATG ATGTAA
|
Protein sequence | MSAQLGGQPI LILKEGGNRT RGRDAQSMNI AAAKAVAGAV RSTLGPKGMD KMLVDTIGDV VITNDGVTIL KEMDIEHPAA KMMVEIAKTQ DDEVGDGTTT AVVIAGELLK KSEELLEMDV HPTVITLGYR QAAEKAQELL QTIAIDVKAK DTAILSKIAG TAMTGKNAEA SKDKLCDLIV RAITLVADAD GTVDTENVKV EKRVGGSIEE SEIIEGMIID KERVHPGMPK SVKNAKILLL NAAVEYKKTE VDAEISITSP DQLQMFLDEE ERMIKGIVEK IKASGANVLF CQKGIDDIAQ HYLSKAGIFA TRRVKKSDME KLARATGGAL ISSIDAISAD ELGVAGIVEE RKVGGEEMIF VEKCKNPKAV SIIIKGGTDH VVDELGRALE DALRVVACVV EDKKVVAGGG APEVELSLRL REYAATQGGR IQLAIEAFAG ALEVIPRTLA ENAGLDPIDK LVELRAAHEK GKKTYGLDVF EGKAVDMWEA GVVEPLRVKT QAISSAAEAA VMILRIDDVI ASAKSAGPSP EEMAAMGGGM GGMGGMPPGM M
|
| |