Gene Nther_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2021 
Symbol 
ID6315876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2132149 
End bp2133114 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content35% 
IMG OID642644409 
Productprotein of unknown function DUF199 
Protein accessionYP_001918176 
Protein GI188586631 
COG category[S] Function unknown 
COG ID[COG1481] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00647] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTTG CTAAATCATG CAAAAATGAA CTTTCAAGAA TTGAAATTAA TAGAGAATGC 
TGCGAAAGAG CTGAACTAGC TGCTTTTATT CATATGAATG GTTCTTTAAC AATAAAAGGA
GACGTTACCC TTCATTTAAC AACAGAAAAT CCAGCTATTG CTAGGCGTAT ATTCCGAGTT
TTCAAGAGTA GATTTAAAAA AGAAATGCAG ATATTAATGA GAAAAAAAAT GCGTTTGCAG
AAAGGTAATA GTTACTCACT TATATTAACT GGAAAGAATA CAGTCAGCCT TGTCCTATCT
AATTTGGAAA TTACCAAGGG AAGTTTTGAT TTAAATACCG GAATAACTCC AGAACTAGTA
GCTAATAGAT GCTGTAAGAG GGCTTATTTA AGAGGAGCTT TTATGGCACG GGGTTCTATT
GCGAACCCCG ATGCCAGTTA TCATATGGAG ATGACTGCTG ATTACGAAGA GTATTTGGAT
GATCTCATTA AAGTAATGCA GTATTTTGAG CTATCCCCAG GTAAACTTGC AAGAAAAAAG
GAGTACGTGA GTTATTTAAA GGATAGTGAG CAGATATGTG AGTTTCTGAA TATTATTGGA
GCTCACAAAA CCCTCCTTGA TTACGAAAAC GTGAGGGTTA TGAAAGGTAT GAGAAATAAG
ATAAATCGTT TGGTGAACTG TGAAACAGCA AATCTTCAAA AAACTGTTGT AGCCTCTTTA
AGGCATATAA AAAATATACA AACAATAGAT GAAAATCTTG GATTGACACA ACTTCCCAAA
TCTCTACAAG AAGTGGCAAT TAAAAGAGTT GAATACCCAG AAGCTAATTT AAAAGAATTA
GGAGAGCTTT TAGAACCTCC AGTGGGCAAA TCCGGGGTTA ATCATCGCCT ACGGAAACTA
GAAAAAATTG CTGAACAGTT GCATCAAACT GGATATTACG ATGAAAATAA TGGATATTTA
CAATAA
 
Protein sequence
MSFAKSCKNE LSRIEINREC CERAELAAFI HMNGSLTIKG DVTLHLTTEN PAIARRIFRV 
FKSRFKKEMQ ILMRKKMRLQ KGNSYSLILT GKNTVSLVLS NLEITKGSFD LNTGITPELV
ANRCCKRAYL RGAFMARGSI ANPDASYHME MTADYEEYLD DLIKVMQYFE LSPGKLARKK
EYVSYLKDSE QICEFLNIIG AHKTLLDYEN VRVMKGMRNK INRLVNCETA NLQKTVVASL
RHIKNIQTID ENLGLTQLPK SLQEVAIKRV EYPEANLKEL GELLEPPVGK SGVNHRLRKL
EKIAEQLHQT GYYDENNGYL Q