Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2383 |
Symbol | |
ID | 6314270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2545357 |
End bp | 2546553 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 642644771 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001918536 |
Protein GI | 188586991 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.501705 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAA TTCATTTAAT CAGCGGGAGC GAAGGAGGCG GTTCACGATT TCAGGTATTA AGCCTCCTGG AAGAACTTGT TCAAAATACA GATGATGACC ATGATATCGA ATTGGTATCT TTAATGGAAG GGCCTTTAAC TCAAGACGCA CGGGAACGTG GTCTTCCCAT CAAAGTTATT CCCATGAAAG GAATGTTGGA TTTTAGAGTT ATCTCTCCCT TGATTAGATA CCTGGCGGAA AGGAAGCCAG ATATACTCCA TACTCATGGG GTGAGAGCAA ACTTTATTGG CAGATTGACT TACAAATTTA TGTTATCTGA TCTAAAACCC GTAATGTTTA CTACAGTTCA TTCGTCTATT TATCATGATT ACACAAACAG CTGGAAAAGA TTTATTTATC CTTTTATGGA AAAGAGCTTG CGAGCAGTTG TTCATCGTTT TATAGCAGTA TCTGATGGTT TGTATCAGGA GCTTTTGCAG GACGGTATTG AAGAAGACAG ATTGGCTTTG GTGCCAAATG GGATCTACAC AGAGAAATTT TCACCTGATA CTAACTCCGA CAGTGACCCT ACTACCTTGA AGGAAGAACT GGGGATTCCG GAAGAGGCGA CGGTAATTTT GACTGTAGGT CGGCTTGTTC CGGTTAAGGG TCAAGATTAT CTCCTGGAAG CTTTTAAAGA CTTATTGGAA GATTTGACAG AAGAAGAGGA TATTGGTACT TACTCACAAG AAAAATTGCC TTATCTTGTC ATCGTAGGAG ATGGTCCCCT AGGTGATAGC TTATCTTCTA AGGCCAAGAG CTTGGGGATT GAAGAAAAAG TTATCTTTAC TGGTTTTCGG CGGGATATTC CTGCTTTCTT CCAAATGGCT GATATATTTA CCCTGCCATC TTTAATGGAG GGTATGCCTA TCATTTTATT GGAAGCTATG GCAGCAAGGT TACCATTGGT GGCTAGTCGG GTAGGGGGAG TATCTGAAGT GGTTAATGAA GGCGAAACTG GCCTCATGGT ACCATCAAAA GACCCGAAAA CACTAGCGGA AGCTTTAAAG AGACTGTGGC AATCCCCGGA TTTATGCCGT AAATTGGGTG GCCAAGCTGG TGAGAGAGTT GAGCGAGACC ATCATTTTTC TCGGGTTGTC ACTGAAACTT TGAAATTATA TCAAAAGACT GTTAACTCAA AAAGGGAACC CGGATGA
|
Protein sequence | MKVIHLISGS EGGGSRFQVL SLLEELVQNT DDDHDIELVS LMEGPLTQDA RERGLPIKVI PMKGMLDFRV ISPLIRYLAE RKPDILHTHG VRANFIGRLT YKFMLSDLKP VMFTTVHSSI YHDYTNSWKR FIYPFMEKSL RAVVHRFIAV SDGLYQELLQ DGIEEDRLAL VPNGIYTEKF SPDTNSDSDP TTLKEELGIP EEATVILTVG RLVPVKGQDY LLEAFKDLLE DLTEEEDIGT YSQEKLPYLV IVGDGPLGDS LSSKAKSLGI EEKVIFTGFR RDIPAFFQMA DIFTLPSLME GMPIILLEAM AARLPLVASR VGGVSEVVNE GETGLMVPSK DPKTLAEALK RLWQSPDLCR KLGGQAGERV ERDHHFSRVV TETLKLYQKT VNSKREPG
|
| |