Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1571 |
Symbol | |
ID | 6314494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1651871 |
End bp | 1652899 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642643943 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001917734 |
Protein GI | 188586189 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 1.79772e-16 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATATCA AGTTTAGTGT GATAATACCT ACCCATAATA GGTCCGAACA ATTATCGCTA ACTTTAACCT CTTTTAATAT ACAAACTATG AATAACTTTG AAGTCGTTGT GGTTGATGAT GGTTCAACCG ATAATACAAA AGACCTAGTT GAAAACTTTG AGGCATCCTA TCCATTGACT TATAAATTGA TTGAAAATGC TGGTAGTGCA GCAATGGCTA GAAATAAAGG CATCTCTTGG GCTAGCGGAA AATATCTCAT TTTTTGTGAT GCTGATTTTA TAGTAATTCC TGAATTTATA GAAATCTTTA ACAAATATAT TGATAAAAAT CCCGGGTCGG TAATTTCAGG TTTTCCTGAG TGTTGGAATA AAATTTATAC TTATTTTTAC CCCGACTTCA CTGCTAGACA GAAACAAAAG CTATATCACT CCCATAACTA TTCAGATAGA CAAAAAAATG AATTAGCGAA ATCGAAGAAG ATTGTTCAAA TTATCAAACC CCAAGATATT TACAACAATT TTAATAAAAT CAAACGCCTT GCATCACCAA ATTTAAAAAG AAATATCAAA AAACAGTTTC AAAAAACTGA TGTGGCTTCC TGGTTATTAT TTGTTACAAG ATGTGTTTTA GTTAACAAGA AATACGTTGA AGAATTAGGC GGATTTGACG CTAGTTTTCC TAAAAATGGA CTAGAAGACT GGGAGTTGGG TTATCGACTA AGTAATATTG GAATAGACTT TATTAGCATT CCTAAAGTTT TAGGTTATCA CCAAGAACAT CCACTAAGAG CTCGACAAAA TGAATTTCGC AACCTTTGTC TGATATACGA TAAGTACGAT TTTTCAATAC CTGAGTTTAA TCTGTTCGCC GTATATTGGC CTTGGAAAAA TATTAAAAAA TATAAAGATA GTCTCAGGCT TTTTAAAATA AAGCAAGATA ACAAAAGATT TCGTGAAGTC CGTCAGTTAC AAAATAAGTG GAAATACATG GCCATTAATT TTTATAATAA GTTTGCTGTA AAGGAGTAA
|
Protein sequence | MDIKFSVIIP THNRSEQLSL TLTSFNIQTM NNFEVVVVDD GSTDNTKDLV ENFEASYPLT YKLIENAGSA AMARNKGISW ASGKYLIFCD ADFIVIPEFI EIFNKYIDKN PGSVISGFPE CWNKIYTYFY PDFTARQKQK LYHSHNYSDR QKNELAKSKK IVQIIKPQDI YNNFNKIKRL ASPNLKRNIK KQFQKTDVAS WLLFVTRCVL VNKKYVEELG GFDASFPKNG LEDWELGYRL SNIGIDFISI PKVLGYHQEH PLRARQNEFR NLCLIYDKYD FSIPEFNLFA VYWPWKNIKK YKDSLRLFKI KQDNKRFREV RQLQNKWKYM AINFYNKFAV KE
|
| |