Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2140 |
Symbol | |
ID | 6314799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2262313 |
End bp | 2263332 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642644527 |
Product | Protein of unknown function DUF1722 |
Protein accession | YP_001918294 |
Protein GI | 188586749 |
COG category | [S] Function unknown |
COG ID | [COG3272] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTCAAG ATAATAAGAA CAGTTCAACA AAAAAACAGC AACTGGAGTC TCACAAGAAA GTGACTCCGC GTGTGCTTAT CAGTAGATGT TTAGGATTTG CCCCTTGCAG ATTTGATGGA AGCATGTTCC GAGAGAATTT GATTGATCAA TTACAGGGCT TTGTTAAGTT CGTACCGGTA TGCCCTGAGG AAGAACTGGG TTTAGGAACA CCCCGTAAGA CATTGAGACT ATATGAATCA CGAGAGGGTC ATGGCTTGTA TCAACAAGAC ACGGGTTCAG ATTTGACCCA GGAGATGAAA GACTTCGCTT ATGATTATCT CTCTACTGTA GATAACTTAC AAGGAGCAAT TCTTAAAACT AGATCTCCAT CATGTGCTTT GAAAGATGCC AAAATTTATG CTGAGAAAAC CAGTAACATC ACCTCTAAAA GAGGGGCTGG ACTATTTAGT GAATGTTTAT TAGAACAATG GCCCAACCTT CCTGTTGAAG ATGAAGGCCG GTTGAAAAAT CGTATCATAA GAGAGAACTT CTTAACTAAA ATCTTTAGCC TGGCAAGATT TGCAGAGATT AAATCCAGTG AATTAGTCAA AGAATTAATT AAATTTCATG CTGACCATAA ATTTTTATTT ATGTCTTATA ACGAAAGTGT TAAAAATGAG TTAGGTCGTC TTTTGGCAAA TCAAGATAAC TATTCTACTA AGGAACTATT TTCAGAATAT GAAACTCTAC TCTATAAGAT GTTTCAGGGA GAAAATACTC CAGGTCGTAA AGTCAATGTA CTCATGCATA TAATGGGCTT TTTCAAGGAT GAAGCTAGTA GTGATGAAAA ATCATTTCTG CTCGATACTA TTGAAAAATA TCGTGAAAAT CAACTGCCAT TATCTGTCCC AATAAATATA TTTAGATCTT GGGCTATTAA ATACGAGCAA AGCTATCTCT TGAGTCAGTA TTTTTTTGCC CCTTTTCCAG AAGAATTGAT TAGTTTGGAG GACTCTGGCA AGTCTTCTAG ACCCAAGTAG
|
Protein sequence | MSQDNKNSST KKQQLESHKK VTPRVLISRC LGFAPCRFDG SMFRENLIDQ LQGFVKFVPV CPEEELGLGT PRKTLRLYES REGHGLYQQD TGSDLTQEMK DFAYDYLSTV DNLQGAILKT RSPSCALKDA KIYAEKTSNI TSKRGAGLFS ECLLEQWPNL PVEDEGRLKN RIIRENFLTK IFSLARFAEI KSSELVKELI KFHADHKFLF MSYNESVKNE LGRLLANQDN YSTKELFSEY ETLLYKMFQG ENTPGRKVNV LMHIMGFFKD EASSDEKSFL LDTIEKYREN QLPLSVPINI FRSWAIKYEQ SYLLSQYFFA PFPEELISLE DSGKSSRPK
|
| |