Gene Nther_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2023 
Symbol 
ID6315835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2134530 
End bp2135843 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content38% 
IMG OID642644411 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_001918178 
Protein GI188586633 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGTAT TGAGGTGGTT CTATCCCGGT CTAAAGGTTA AAAGGTGGAT CTTAGTTGTA 
CTACTGGGAG CTTTATTAAG TTTTTTTGGG ATTGTAGTTT TACTGGGACC GTCTAAAGTT
GCCTTACTGG GTAAACAAAG CTTTGTGACT CTAGGTGAAA TATTAGGTGC TTACTCACGA
TATAATGGGA TTATTTTTTT GATAATTGGA GGCTTTACTT GCTGGTACGG TTTTAAGAAA
ACTATGCATT CTGTTATGAC AGCTGCAGTA CCACAAGAGG AAGAAAAGTT AGTTAAACTG
TTAAACAGAA GGAGACAACA AAAAATGGGC CCAAAAATTG TGGCCTTAGG AGGTGGTACA
GGTTTACCTA ATTTATTAAG GGGGTTGAAA CCTTACACAC AAAATATAAC TGCTGTTGTT
ACAGTTTCTG ATGACGGCGG AAGCTCTGGA AAGCTTAGAG GGGAATTTGG AATGGTTCCC
CCCGGTGATG TCAGGAATTG CTTGTTAGCT CTAGCCGATA CTGAACCACT CATGGAAGAA
ATCTTTAATT ACAGATTTAA AACAGAGGGA GACTTGGAAG GGCATAATGT AGGGAACTTA
ATTATAGCCG CCTTAAATGA CAAGAAAGGT TTCAAAGATG CCTTGGCGTC TGTCAGTCGC
GTCTTAGCCG TTAAAGGTAA TGTATTACCA GTTACTGATC AGTCTTTGAC ATTAAAAGCT
AGATGTACTG ATGGAACTAC AGTAGTGGGA GAAAGCAGCA TATCAAATCA GAGCAAACAG
ATCGAACAAG TATATCTTGA TGAACAAGAT GTATCTCCCC TTTCTGAAGT GATTACGGCT
CTCGAAGAAG CTGATGCTAT AATACTTGGT CCTGGGAGTC TTTATACAAG TGTTATACCT
AACTTATTGG TTCCAGGAAT CCCAGAGGCA ATTAAAAATT CCCAAGCTGT GAAAATCTAT
GTATCCAATA TAATGACCCA ACCAGGGGAA ACTGATAATT ATCGGGCATC TGATCATTTG
AGGTCGATTA TACAACACAC AGAATATAAT TTAGTTGATA CTGTCTTAAT TAATGGTGAA
TTAGATGTAG ATTCCCAAAC CCTAGCAAAA TATAAGGAAG AACTACAGGA ACCCGTACAA
CCTGACATTG AAAATCTTAC CAATATGAAG GTAGATACAA TTATAAAAAA CTTTAATGGG
CGAAACTCCC TTATCAGGCA TGATTCGGAA TTACTTGGGG AAGTCATTAT TAAAGAAGTG
ATCAATAAGA AAAAAGAAGT GATTCGAAAA AAATTATTTG CCAGGAGGGA TTAA
 
Protein sequence
MDVLRWFYPG LKVKRWILVV LLGALLSFFG IVVLLGPSKV ALLGKQSFVT LGEILGAYSR 
YNGIIFLIIG GFTCWYGFKK TMHSVMTAAV PQEEEKLVKL LNRRRQQKMG PKIVALGGGT
GLPNLLRGLK PYTQNITAVV TVSDDGGSSG KLRGEFGMVP PGDVRNCLLA LADTEPLMEE
IFNYRFKTEG DLEGHNVGNL IIAALNDKKG FKDALASVSR VLAVKGNVLP VTDQSLTLKA
RCTDGTTVVG ESSISNQSKQ IEQVYLDEQD VSPLSEVITA LEEADAIILG PGSLYTSVIP
NLLVPGIPEA IKNSQAVKIY VSNIMTQPGE TDNYRASDHL RSIIQHTEYN LVDTVLINGE
LDVDSQTLAK YKEELQEPVQ PDIENLTNMK VDTIIKNFNG RNSLIRHDSE LLGEVIIKEV
INKKKEVIRK KLFARRD