Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1419 |
Symbol | |
ID | 5104790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1385892 |
End bp | 1387136 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507308 |
Product | nucleotidyl transferase |
Protein accession | YP_001191501 |
Protein GI | 146304185 |
COG category | [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0198372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00414302 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAATGGT CTCCTGAGGA TATAAAGGTT ATCATTCCCA TTGGGGGAGA AGCAACGAGA ATGCGCCCTT TAACCGTGGA AACCTCGAAA GCGACCGTGA GGCTTCTGAA TAGACCCCTC CTCGAGTTTC CGATTCTCGA GCTCGCAAAG CAGGGAGTAA AGGAGTTCAT TTTTGGCGTT AAGGGGTACG TTAATTACAA GTCTCTCTTT GACACCTTCA AGGAGGGTAT AGGATTTTCA GCTAGGTACA GGATTAAACC AAGGGTGCAC TTCAAGTATC AACCTAGAGT CGACAGCGTT GGTAACGCGG ACTCCGTAAG GATCAACATG GACTATTACA GGATAGATGA CATCACGCTC GTGATCCAGG GAGATAACCT GATCAAGCTG GACCTAAAGA AGCTAGTGGA CTATCACCTG TCAAAGGGAG CGATAATGAC TATCGTGCTC AAGAAGTGGC ACGACGTGAG GGAATTCGGG GTTGCGGACC TTGGGGAAGA CATGAAAATA AGGAAGTTCG TTGAGAAACC CAAGGAAGGA GAGGCTCCAT CTAACCTGAT CAACACGGGA GTCTACGTCT TGTCTCCTAA GATAAGGGAT ATCTTCGCGA GTGATGAGGT TTCGGCCATG AGAGAGGAGG GAAAGATGGA CTTTGGGAAG GACATAATTC CCTGCCTAAT CCAGAAAGGC TACCCAGTTT ACGGTTACGT TACTGACTCT CTCTGGTTTG ACGTCGGGAC TCCTGAGAGG TACTTGGAAG CCATGAGGGT ACTTCTAGAA AGCCTGGACG AACATGAAAT GGGTGGGAAG AGGATAGACC AGTCCAAGAG GATATTTGTC CAGGGAACGA GCCCTGACTC CATAAGGAGG AGAAACGTGA TAGCCATGAA GTATAGAAAG GGTAGGATGA AAATAGAAGG GAGCGTGCTC ATAGGAAGGC ACTGCCAGAT CGGGAACAAC GTTTACCTAG AGAACTCCAC GATTGACAAC TTCTCGATCC TAAGAAATAA CGTCAGGGTT GTGAGGAGCT CCATCATGGA CAGGGCATTC ATTGGCGAAG GAGTAGTGAT AGAGAACTCG GTCATAGCTA GACATGTGGA AATTAGGGGA GGGGCTAGGA TAATTGGGAG TGTCATAGGT GACGATGTGG TGATAGATGC TGACACTGAG ATAGTGAACT CGAAGATATA TCCGCACAAA GTTATTAACG CGAATAGTAA AATACACGAT ACTGTACTGA CTTAA
|
Protein sequence | MQWSPEDIKV IIPIGGEATR MRPLTVETSK ATVRLLNRPL LEFPILELAK QGVKEFIFGV KGYVNYKSLF DTFKEGIGFS ARYRIKPRVH FKYQPRVDSV GNADSVRINM DYYRIDDITL VIQGDNLIKL DLKKLVDYHL SKGAIMTIVL KKWHDVREFG VADLGEDMKI RKFVEKPKEG EAPSNLINTG VYVLSPKIRD IFASDEVSAM REEGKMDFGK DIIPCLIQKG YPVYGYVTDS LWFDVGTPER YLEAMRVLLE SLDEHEMGGK RIDQSKRIFV QGTSPDSIRR RNVIAMKYRK GRMKIEGSVL IGRHCQIGNN VYLENSTIDN FSILRNNVRV VRSSIMDRAF IGEGVVIENS VIARHVEIRG GARIIGSVIG DDVVIDADTE IVNSKIYPHK VINANSKIHD TVLT
|
| |