Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2021 |
Symbol | |
ID | 6315876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2132149 |
End bp | 2133114 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642644409 |
Product | protein of unknown function DUF199 |
Protein accession | YP_001918176 |
Protein GI | 188586631 |
COG category | [S] Function unknown |
COG ID | [COG1481] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00647] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTTG CTAAATCATG CAAAAATGAA CTTTCAAGAA TTGAAATTAA TAGAGAATGC TGCGAAAGAG CTGAACTAGC TGCTTTTATT CATATGAATG GTTCTTTAAC AATAAAAGGA GACGTTACCC TTCATTTAAC AACAGAAAAT CCAGCTATTG CTAGGCGTAT ATTCCGAGTT TTCAAGAGTA GATTTAAAAA AGAAATGCAG ATATTAATGA GAAAAAAAAT GCGTTTGCAG AAAGGTAATA GTTACTCACT TATATTAACT GGAAAGAATA CAGTCAGCCT TGTCCTATCT AATTTGGAAA TTACCAAGGG AAGTTTTGAT TTAAATACCG GAATAACTCC AGAACTAGTA GCTAATAGAT GCTGTAAGAG GGCTTATTTA AGAGGAGCTT TTATGGCACG GGGTTCTATT GCGAACCCCG ATGCCAGTTA TCATATGGAG ATGACTGCTG ATTACGAAGA GTATTTGGAT GATCTCATTA AAGTAATGCA GTATTTTGAG CTATCCCCAG GTAAACTTGC AAGAAAAAAG GAGTACGTGA GTTATTTAAA GGATAGTGAG CAGATATGTG AGTTTCTGAA TATTATTGGA GCTCACAAAA CCCTCCTTGA TTACGAAAAC GTGAGGGTTA TGAAAGGTAT GAGAAATAAG ATAAATCGTT TGGTGAACTG TGAAACAGCA AATCTTCAAA AAACTGTTGT AGCCTCTTTA AGGCATATAA AAAATATACA AACAATAGAT GAAAATCTTG GATTGACACA ACTTCCCAAA TCTCTACAAG AAGTGGCAAT TAAAAGAGTT GAATACCCAG AAGCTAATTT AAAAGAATTA GGAGAGCTTT TAGAACCTCC AGTGGGCAAA TCCGGGGTTA ATCATCGCCT ACGGAAACTA GAAAAAATTG CTGAACAGTT GCATCAAACT GGATATTACG ATGAAAATAA TGGATATTTA CAATAA
|
Protein sequence | MSFAKSCKNE LSRIEINREC CERAELAAFI HMNGSLTIKG DVTLHLTTEN PAIARRIFRV FKSRFKKEMQ ILMRKKMRLQ KGNSYSLILT GKNTVSLVLS NLEITKGSFD LNTGITPELV ANRCCKRAYL RGAFMARGSI ANPDASYHME MTADYEEYLD DLIKVMQYFE LSPGKLARKK EYVSYLKDSE QICEFLNIIG AHKTLLDYEN VRVMKGMRNK INRLVNCETA NLQKTVVASL RHIKNIQTID ENLGLTQLPK SLQEVAIKRV EYPEANLKEL GELLEPPVGK SGVNHRLRKL EKIAEQLHQT GYYDENNGYL Q
|
| |