Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A0597 |
Symbol | |
ID | 3628003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 718417 |
End bp | 719703 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637699490 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_304157 |
Protein GI | 73668142 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCG TGGAAGATGC ACAAAAAGGG ATTATTACTG AAGAAATGAA GATTGTTGCA AAGGACGAAG GACTTGACCC TGAATTCATC CGTCGTGGTG TTGCAGCCGG AAGAATTGTT ATTCCAACCT CCCCATACAG GCAGGTAAAG ATCTGCGGTA TAGGAGAAGG GCTCAGGACC AAAGTCAATG CATCCATCGG TGTATCCTCG GATATTGTTG ATGCAGACAT GGAAGTTAAA AAAGCACAGG CTGCCGAAGC TGCAGGTGCA GACACCCTTA TGGAGCTCGG AACTGGTGGA GACTTCCTTG CAATCAGGAA AAAAGTCATT GACAGTATTT CCCTTTCAGT CGGTTCAGTG CCTCTTTACC AGGCCTTCAT TGAGGCCGCA AGGAAATACG GCTCAATCGT GGATATGACC GAAGACGAAC TCTTCAAGGC AACCGAAGAC CAGGCAAAGC TCGGAACTAA TTTCATGGCA ATTCACACAG GAATCAACAA TATCACCATG GACCGCCTTA AAGCCCATGG CAGGTACGGT GGCCTCTGTT CCCGTGGTGG CGCCTTTATG ACTTCCTGGA TGCTCCACAA TGAAAAGGAA AATCCACTTT ATGCAAACTT TGATTACCTT GTTGAGATCC TCAAGGAACA CGAAGTAGTC CTCTCTACCG GAAACGGTAT GCGTGCAGGT GCAGTCCACG ATGCAACCGA CCGTGCCCAG ATCCAGGAAT TAATTATTAA CTCCGAACTG GCCGACAGAG CCCACAAGCA GGGTGTGCAG GTCATTGTCG AAGGTCCGGG TCATGTCCCT CTCGACCAGA TAGGAACCAA CGTAAAACTC ATGAAGGAAA TGAGCGGTCA CAAGCCATTC TACATGCTCG GCCCACTTGT AACTGACATC GCACCAGGTT ACGACCACAT CGTAACTGCA ATCGGAGCAT CGGTTTCTGC TTCATATGGC TGTGACTTCC TTTGCTATGT AACTCCTGCA GAGCACCTTG CCCTTCCAAA CCTTGAAGAT GTTATCACAG GAGTCAAAAC CTCAAAGATT GCAGCTCACG TAGGCGATAT GGTAAAATAT CCAGACAGGG CAAGAGAACA GGACCTTGCT ATGGGCAGAG CTAGAAGAGA CCTCGATTGG CAAAAGATGT ACTCTCTTGC AATCGACCCA GAACACGCAA AAGAAGTTAG GAACAGCAGG GCTCCCGAAG ATTCTGACGC CTGCACAATG TGCGGTAACT TCTGCGCCCT CAAGATCGTA AACCAGAACT ACAACCTCGC AAAATAA
|
Protein sequence | MTIVEDAQKG IITEEMKIVA KDEGLDPEFI RRGVAAGRIV IPTSPYRQVK ICGIGEGLRT KVNASIGVSS DIVDADMEVK KAQAAEAAGA DTLMELGTGG DFLAIRKKVI DSISLSVGSV PLYQAFIEAA RKYGSIVDMT EDELFKATED QAKLGTNFMA IHTGINNITM DRLKAHGRYG GLCSRGGAFM TSWMLHNEKE NPLYANFDYL VEILKEHEVV LSTGNGMRAG AVHDATDRAQ IQELIINSEL ADRAHKQGVQ VIVEGPGHVP LDQIGTNVKL MKEMSGHKPF YMLGPLVTDI APGYDHIVTA IGASVSASYG CDFLCYVTPA EHLALPNLED VITGVKTSKI AAHVGDMVKY PDRAREQDLA MGRARRDLDW QKMYSLAIDP EHAKEVRNSR APEDSDACTM CGNFCALKIV NQNYNLAK
|
| |