Gene Nther_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2156 
Symbol 
ID6316020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2281260 
End bp2282396 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content40% 
IMG OID642644543 
Productprotein of unknown function DUF453 
Protein accessionYP_001918310 
Protein GI188586765 
COG category[S] Function unknown 
COG ID[COG2828] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.738715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCAGG AAAAAGTCCC AACAACTATT ATGAGAGGCG GCACGAGTAA AGCCATTTTT 
CTTAAGGATA AAGATCTTCC GACAAATCAG GAAGAAAGAG ATAATTTGAT TCTAAGAATT
TTTGGAAGCC CCGATCCAAG GCAAATTGAT GGCCTTGGGG GGGCAGACCC TCTAACAAGT
AAACTTGCAA TTATTGGACC ACCCACTAGA GAAGATGCTG ATATTGATTA TACTTTTGGG
CAAGTTTCAT ATACTGCTGC CAAAATTGAT TATTCCGGTA ATTGCGGGAA TGTATCCTCA
GCTGTAGGAC CGTTTGCAAT AGATAAAGGT TATGTCCAAG CTGAAGAACC ATATACTACT
GTGAGAGTTC ATAACACAAA TACTAACAAG ATTTTGATTG AGAAAGTACC TGTTGTAGAT
GGCTTATCCA AAGTAATTGG TGATTATCAA ATTGATGGTG TTCCTGGGCA AGGTGCTCCA
ATATCTATAG ACTTTTCCGA TACTGCCGGT GCAAAAACAG GTGAATTACT TCCTACAGGC
GAGGAAGTGA ACAAAATTTC GACTGAATCT TGTGGAGAGA TTGAGGCATC TTTAGTTGAT
GCTGGTAACC CCATGGTTTT TGTTAGAGCT GAGGACTTGG GTTTACGAGG AAATGAAACC
CCCGAAGAAA TTGACAATAA TGAAGAAGCT TTAAAAACCC TTGAAGAAAT AAGGGGAAAA
GCAGCTGTAA TGATGGGAAT TGAAACTGAT TGGAAACAAG CTGAACAAAA TAATCCGGCT
TTTCCAATGG TAGCTTTTGT ATCACCAGCT TCAGATTCAC AACAGGAAAG TGGCATAGAT
TTCAATTCCC GTTTGATGTT TATGCAGGTT ATGCACAAAA CTTATGCTGG TACTGGCACC
ATCTGTACGG GTTCAGCAGC TATGATTAAA GGCACCATTG TTAATCAGGT TATGAGTTCC
AAAAAAGACC AAGATGAAGC AACTATTAAG ATCGGACATC CAGCAGGATT TATTGAAATT
GAAGTCCGGG TCGATGAGGA TGAACAAAAT GGAAATTGGA TATTGAACAA AGCTGCTATT
AATCGGACAG CTCGAAGAAT TATGGATGGG AACTGTTATA TACCAAAAGA AGGCTAG
 
Protein sequence
MDQEKVPTTI MRGGTSKAIF LKDKDLPTNQ EERDNLILRI FGSPDPRQID GLGGADPLTS 
KLAIIGPPTR EDADIDYTFG QVSYTAAKID YSGNCGNVSS AVGPFAIDKG YVQAEEPYTT
VRVHNTNTNK ILIEKVPVVD GLSKVIGDYQ IDGVPGQGAP ISIDFSDTAG AKTGELLPTG
EEVNKISTES CGEIEASLVD AGNPMVFVRA EDLGLRGNET PEEIDNNEEA LKTLEEIRGK
AAVMMGIETD WKQAEQNNPA FPMVAFVSPA SDSQQESGID FNSRLMFMQV MHKTYAGTGT
ICTGSAAMIK GTIVNQVMSS KKDQDEATIK IGHPAGFIEI EVRVDEDEQN GNWILNKAAI
NRTARRIMDG NCYIPKEG