Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1604 |
Symbol | |
ID | 5103968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1551886 |
End bp | 1552944 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507494 |
Product | nucleotidyl transferase |
Protein accession | YP_001191683 |
Protein GI | 146304367 |
COG category | [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0798438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.128956 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAC TGATACTCGC CGCAGGAAAA GGAGAAGGAC TAAGCCCCTA CAGTGATAAG GTGCAGAAGG AGACAATCCG AATTGTGGGT AAACCTGTAA TACAATACGT AATAGAAGGA CTTGCTTCAG TTGGAGTTAC AGAATTCGTA ATCGTCGTGA ACGAAAAGGA GGAGCAGGTA TTGCAGGCAG TGAAGGACTT GAACGTGAGG ATTGAGACAG TTAGGCAGAG GTCCCCTGGA ATCTCTGGGG CTGTACAGGA TGGGATGGAG TACATGGATG ACATGTTTGT TCTGGCCTTT GGAGATATAG TAACGCCCAA GGACTTTTAC AGGGAACTGA TTACCACGTA CCACAGATCT GGTTCCCCTG TATTTTCCAC AGTTCCCGTA TCAACGGGTC TAGACACCTA CGGGCTCGTG AAGGTTGATC AGGGACTGCG GGTTGTGCGC GAGGGATCCA CGCTTGCATT GGCAGGAGCC TACGTAATTC CCAAGAAACC GTTCAATGAC TTTCTCCAGT ACCTAGATGA AGTGGCCAGG GACGCTGACT ATTTTGTATG GACAGGATCG TGGGTAGACA TAGGATATCC GGAAGACTTA ATTCAAGCAG TTGAGGAGCT ACTGAAATCC GAGTCGTCAA GGATATCTAA CAAGGCTAGC ATAGCTAGCA CTGCAGTGAT AGGAAAGAGT GTAATAGTGG AGGATGGGGC CACGATCGAG GATTTTGCGA TAATAAAGGG TCCAGCATAC ATTGGAAGAA ATGCGTATGT GGGTTCGTTC TCCCTTGTTA GGGATTTCTC GTCGATTGAG GAGAGCGCAA TCATTGGAGC CTACTCTGAA ATAGCCCATT CATTGTTGGG CCCCTACTCT GTCGTAGGAT CTAAGTCGTA CATTACTCAC AGTATCATAG GAGACAGAAC TAGGGTGGGA GCATCTGTCA TAACAGCTAG CTATCCCGCA ACTGTGAAAA GGCAGGTCTC TGGAAAGTTT GGCGCTTTAA TCTCTCCAGA TGAGAGTATA CCGCACGGGG TAGTGATTGG GCCCTCCTAC AGGAAATAG
|
Protein sequence | MKALILAAGK GEGLSPYSDK VQKETIRIVG KPVIQYVIEG LASVGVTEFV IVVNEKEEQV LQAVKDLNVR IETVRQRSPG ISGAVQDGME YMDDMFVLAF GDIVTPKDFY RELITTYHRS GSPVFSTVPV STGLDTYGLV KVDQGLRVVR EGSTLALAGA YVIPKKPFND FLQYLDEVAR DADYFVWTGS WVDIGYPEDL IQAVEELLKS ESSRISNKAS IASTAVIGKS VIVEDGATIE DFAIIKGPAY IGRNAYVGSF SLVRDFSSIE ESAIIGAYSE IAHSLLGPYS VVGSKSYITH SIIGDRTRVG ASVITASYPA TVKRQVSGKF GALISPDESI PHGVVIGPSY RK
|
| |