Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2078 |
Symbol | |
ID | 5105058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1996599 |
End bp | 1997540 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507968 |
Product | bifunctional phosphoglucose/phosphomannose isomerase |
Protein accession | YP_001192142 |
Protein GI | 146304826 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0166] Glucose-6-phosphate isomerase |
TIGRFAM ID | [TIGR02128] bifunctional phosphoglucose/phosphomannose isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.810924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAATG TTTTCCTAGA CTGGGATAAA CTTTTTCGAG AAGCTGAAAG GATAGAGGTT CCGGACCTCA AGTACGATAA CGTGATTTAC ACAGGGATGG GGGCAAGTTA CATCCCGGGT GAAATGGCAA GGATACTTGA ACCTCCATTG GACTACCTCG TGTATAATGG GGATCCCACA AGATTCAAGG CTAGGGGAAA GTTCTCTTTG CTAGCCTTTA GCAGGTCCGG GGACAATGTG GAAACCTTGA TCGTAACCAG AAGGGCCTTT GAGCTTGGGG CGGACGTCAT ATGCGTTTCG GCTGGAGGAA AGCTGGCAAA CCTGTGCAGA GAGAAGGGCG GAAGACACGT TAACTTGTCC ATGCAGGCCA GGATGAGCAA CGGTCAAGAT TACCCCACGA GAGTCTGGTT CCCCCTCCTT TTCACGGCCC TCGTTAAGAT TCTGAACACG AGGAGTAGCG GGCAATACAG GATTTCGGAG CTGGCAGAGG GAGTGGAGGA AGGTAAGGAG AGGGCTCTTA ACCTTGCTAA GAGGTTGGTG GCCAAGATTA GGGGAAGGAT CCCGGTCTTT TACGGCTCCC TATACTTTCC CGTAGCAATA AGGTTCAAGC AGGATTTGAA TGAGACTGCC AAATATCCAG CCTTCTACGG GCCCATTCCT GAATCGAATC ACAATGACCT AGAGGCATAC GTCAGGGCAC AGAGCCTTCA GCCCTTTGTG ATTGGGGATC AGGACATTGA TTACGTAACG CTTTCCGTGA TTAAGGCTGA ACAGATAATT CCTGCAGGGA GCACACCGCT GAAGAACGTG GCCTACTTGG TTCTTCTCTC AGGTCTCACG TCCCTCCTGC TTGCAGAGGA AGAGGGATTA ACGGAAGAAG AGGCCTTCAG CGACAGCAAC CTTAAAATTG CAAGGAAACT GGCAAACTTA ATCCTGAAGT GA
|
Protein sequence | MHNVFLDWDK LFREAERIEV PDLKYDNVIY TGMGASYIPG EMARILEPPL DYLVYNGDPT RFKARGKFSL LAFSRSGDNV ETLIVTRRAF ELGADVICVS AGGKLANLCR EKGGRHVNLS MQARMSNGQD YPTRVWFPLL FTALVKILNT RSSGQYRISE LAEGVEEGKE RALNLAKRLV AKIRGRIPVF YGSLYFPVAI RFKQDLNETA KYPAFYGPIP ESNHNDLEAY VRAQSLQPFV IGDQDIDYVT LSVIKAEQII PAGSTPLKNV AYLVLLSGLT SLLLAEEEGL TEEEAFSDSN LKIARKLANL ILK
|
| |