Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1977 |
Symbol | |
ID | 7270783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2103219 |
End bp | 2104493 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643570592 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002467003 |
Protein GI | 219852571 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.540426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGA TCAAGGATGC ACAGCGCGGC CTCGTCACCG AGGAGATGAA GCTGGTCGCT GCACAGGAGG GCGTAACCGA GGAGTTCGTC AGAAAAGGGG TCGCTGGAGG GCATATCGTT ATTCCGGTCT CCCCCTACCG AAAGGTTAAG ATCTGCGGGA TCGGCGAGGG GCTCCGCACC AAGGTGAACG CCTCGATCGG AACATCCTCC GATATCAGCG ACGTTTCGGT GGAGATCGAG AAGGTCAGGC AGGCAGAACT GGCAGGTGCC GATACGCTGA TGGAACTCTC CACCGGCGGG GACCTGGCCG ACATCAGGCG CCAGGTCATC GCAGCCACCT CCCTCTCGGT CGGTTCCGTG CCCCTGTACC AGGCTTTCAT CGAGGCTGCC CATAAGAAAG GCGCAGTTGT CGATATGGAG GCGGACGACC TCTTCCGGAT CACTGCCGAA CAGGCAAAGG CAGGCACCAA CTTCATGGCG ATCCACACGG GGATCAATTA CGAGACGATG AAGCGGCTGC AGAACCAGGG CAGGCACGCC GGGCTCGTCT CCCGCGGCGG TGCGTTCATG ACTGCATGGA TGCTCCACAA CGAGAAGGAG AACCCACTCT ACGCTGAGTT CGACTACCTC CTCGAGATCA TGAAGGAGCA TGAAGTGACC CTCTCGATGG GGAACGGCAT GCGAGCCGGC GCCGTCCATG ACTCTACGGA CCGTGCAGCC ATCCAGGAAT TGCTGATCAA CGCAGAACTG GCAGACAAAG CCTTCAACGA AGGGGTTCAG ACGATCGTCG AGGGACCCGG GCATGTCCCG ATCGATGAGA TCCAGGCCAA TGTGATCCTG CAGAAGCGGG TCACGAACCG CAAGCCCTTC TACATGCTCG GCCCACTGGT CACCGACATC GCACCAGGCT ACGATGACCG GGTCGCCATG GTCGGTGCAG CTCTCTCCTC GTCGTACGGT GCAGACTTCA TCTGCTATGT GACGCCGGCG GAGCATCTGG CCCTTCCGAC ACCTGAAGAG GTCTTCGAGG GAGTCATCTC CTCCCGTATC GCAGCTCACA TCGGGGACAT GGTCAAGCTG AACAAGCGCG ACGATGATCT GGAGATGGGA CATGCGAGAA AAGCCCTCGA CTGGGACCGG CAGTACGCAG TTGCAATCAA CCCGAAGAGG GCGAAGGAGA TTCGGGACAG CCGGATGCCG GCCGATACCG ACGGGTGCAC GATGTGCGGC GACTACTGTG CCATCAAGAT CGTCGCCAAG CACTTCAACT TCTAA
|
Protein sequence | MSLIKDAQRG LVTEEMKLVA AQEGVTEEFV RKGVAGGHIV IPVSPYRKVK ICGIGEGLRT KVNASIGTSS DISDVSVEIE KVRQAELAGA DTLMELSTGG DLADIRRQVI AATSLSVGSV PLYQAFIEAA HKKGAVVDME ADDLFRITAE QAKAGTNFMA IHTGINYETM KRLQNQGRHA GLVSRGGAFM TAWMLHNEKE NPLYAEFDYL LEIMKEHEVT LSMGNGMRAG AVHDSTDRAA IQELLINAEL ADKAFNEGVQ TIVEGPGHVP IDEIQANVIL QKRVTNRKPF YMLGPLVTDI APGYDDRVAM VGAALSSSYG ADFICYVTPA EHLALPTPEE VFEGVISSRI AAHIGDMVKL NKRDDDLEMG HARKALDWDR QYAVAINPKR AKEIRDSRMP ADTDGCTMCG DYCAIKIVAK HFNF
|
| |