Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1045 |
Symbol | |
ID | 6314226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1110022 |
End bp | 1111092 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642643417 |
Product | hypothetical protein |
Protein accession | YP_001917217 |
Protein GI | 188585672 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0229946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCAGTAA GAAGGTCATT TCAATTATTA TTGATCACAT GCTTGTCTGT TTCAGTAACT TTTATGACAG GTTGCACACC AGAGTACGAA GTGGAGATAC AAGCTGATCC TGAAGAAGCA GGAGAAATTG AAGGTGAAGG CACTTACGAA GAAGGAGAAG AGGTTACAGT TGAAGCTGAA CCTAATGAAG GTTATGAATT TAAAAAATGG GAAAAAGAAG GTGAAGAAGT AAGTCAAGAT CAAGAATATA AATTTGAAAT TGAAGAAAGT ATAGAATTAG TTGCTGAATT TGCAGAAACT GTAAACATTC CTGATGAAAA CTTAGAAGCA GCTATTAAGG AAGAGTTAGG TGTAGATCAA GTGACAAAAG AAAATATTAA ACAGTTAACA TCTTTGGAGG CGAGAAGAGA AGGGATTAGC GATCTGATTA ACTTGGGAAA GGCAGAAAAT CTTAAAAACT TAAACCTTTC GGGCAATAAA ATTCAAGATA TAACCGCTTT AACTGAACTT ACGGGACTAG AGAAGTTAAA CTTAAACAAT AATGAGATAA CAGATATTAA AGCACTACAT GAATTGACTA ATCTTAAAGA AGTCAACCTT ATAGGAAATG AAATCGATGA AATAAACTTT TTAGGAGAAT TAAATGATCT CAAAAAACTT TCTGTAAGGG ATAATGAGAT GAATTTGACA TTGGTTGATT TTGACCAATC TCCTGGAGAT GACTATATAG CTTATAGGGT AAAAAGCCCA TTACCCTTAA CTGTCGATGT TCAGGTAGTA AACATAAATG AAGACTTAAG CGTTGAACAG GTATTTGAAC CAAATAGTGT TACTGATCGG GGCGAGCTTT ATAGTTTTAC TGAAGCGTCT TCCTATGAAG ACCCATTTGT TCAGCATGTT CCCAAAAAAC ATCCATTTGA CGTTTATAAA GACCATGAAT GGAAAAATAC ACAAGTTTTA AAGGTTGTTG TTCGTGAAGA ACATGAAGAG GGTAAGGATG GAGAGCACAA GTTACCTGGA GAGTATAAAA TCGATTTTAA AAACAGAGAG ATTATTGAAA TTAATAATTA A
|
Protein sequence | MSVRRSFQLL LITCLSVSVT FMTGCTPEYE VEIQADPEEA GEIEGEGTYE EGEEVTVEAE PNEGYEFKKW EKEGEEVSQD QEYKFEIEES IELVAEFAET VNIPDENLEA AIKEELGVDQ VTKENIKQLT SLEARREGIS DLINLGKAEN LKNLNLSGNK IQDITALTEL TGLEKLNLNN NEITDIKALH ELTNLKEVNL IGNEIDEINF LGELNDLKKL SVRDNEMNLT LVDFDQSPGD DYIAYRVKSP LPLTVDVQVV NINEDLSVEQ VFEPNSVTDR GELYSFTEAS SYEDPFVQHV PKKHPFDVYK DHEWKNTQVL KVVVREEHEE GKDGEHKLPG EYKIDFKNRE IIEINN
|
| |