Gene Nther_0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0571 
Symbol 
ID6314713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp598571 
End bp600427 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content39% 
IMG OID642642954 
ProductHAD-superfamily hydrolase, subfamily IA, variant 1 
Protein accessionYP_001916754 
Protein GI188585209 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0546] Predicted phosphatases
[COG2006] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAGCA AAGTTTCTAT AGCCCAAGTT GGTTGCCAGT ATAGTGATGA CATTATCGCT 
GCTAAAATGA AGCAGCTATT TTTTGACACG GGTTTAAATG ATTATATAGC TCCAAATAAA
CAGGTTCTGA TTAAGCCCAA CCTAGTGGCG GTTCCCCCGG AAGATTTTCG AGGAGCCATT
ACTCATCCTT TAATAGTTCA AAAACTGTCA GATATAGTCC GATCCTTGGG AGGACAAGTT
ATTATTGGTG ACTCATCAGC AGTAGGGGTC AACACAGAAG ATGTTATCTC TACTACGGGA
TATGAGAAGC TAAGACAGCA GGGCTATCAA GTTATTGATT TAAAACAGGA TTCGGTAGTA
GACCTTCAAG TTCCAGGAGG TGGAAAAGCA CTGTCCCAGT TGCCTGTAGC CAAAACAGTC
AAAGACGTAG ATCTAATAAT TTCAGTACCA GTTATGAAAA CTCATGATCA GGTAGAAGTT
AGCCTGAGTA TCAAAAATCT AAAAGGACTA CTACCAGATA AAATAAAAAA AGCATTTCAT
AATAAATATG GCCTAGCTAA AGGAGTATCA GATATACTGG CAACTGTGCC TCCAGTGGTT
TCAGTCCTAG ATGCCACCTA TGCATTAGAG GGTATGGGGC CTGTCTATGG AGAATCTGTA
CCCATGGGAC TTATTTTAGC CAGCAGCGAT CCTGTGGCCT TAGATTCAAT AGCTGCAGGT
ATAATGGGTT TGGAGGAAGA TGAACTTAAG ATTGAGGGAG AATGTTATAA TAGGTGTTTA
GGTGAATTGA GACGAGATAA GATCACCATC TCAGGTGATG TGACAGATAT AGATCAAGTA
GCGAGAAGAT TTACTCGGAT CAAAGATTTG GATTATCAAT TTAATGTTGA CTTTGATTTA
ATATTTAATG AGGAAGTTTG TACAGGCTGT AAAAATACAG TTATGAGCTC CTTAGATGAT
ATCCAAACTC AGGGAGTGGA GCCATATTTA TCGGGAAAAA CTGTATATGC CGGTCCATTA
ACACAAGGTG AGATTAGTGG TAGCTCCAAC TCTATACTGA TCGGAAATTG CCTATACAAG
CATAAAAGTC AAGGAACTTT TGTGCCCGGA TGTCCGCCGG AAAATTTACC TGTAATTGAA
GGCCTTGTAG GTGAAGGAAA GATTGCCAGA CGTTATACTA GTGAAAATCA AAGTCAGTTT
AATCATCCTT GGGGCATTAT CTACGATTTA GATAATACAT TGATCAACTC AAAAATAAAT
TTTAATAAAA TGAAAGTAGA AGTCATGAAT TATCTCCAAG AAGAACAGCT TTTGCCCGAA
ATTACTAACC TTGAGAAACA TACTGCTGCT ACCTTGATTC AAACTGCTCG TCAACATTCC
ACTTTGGATC AGGAACAAGA AGATGGATTG TGGGCTTTAA TTACTGCCAT TGAAGCTGAA
GGCATGGATA AGGCCGAAAC AGAACCTGAT ATCAATGAAG TGATTTCAAC TTTAGCTAAT
GAGTATACTC TGATCGTATT GACTAATAAC TCATATAAAG CTGCTATGAA AGCTTTGAAA
CAATTTGGCT TAGATGAATA CTTTCAATTG GTGGTAGGTA GGGAGCAGAT GACTAGTTTG
AAACCATCTC CTTCTGGTGC GGAATACATC TTAGAGCATT TTGGTGATAC TAAAGCTGAA
GATTGGGTCA TGGTAGGAGA TTCTTGGATA GACGCAAAAG CAGCCCAAGA TGCTTCAATA
CCATTTTTAG CCTACAATTG TAATCTACAA GAGTTGATTG ACCGGGACAT TCCTTGGGAA
GAAAATTTAA AGCACCCCTG GGATATAATA AACTATCTAG ATAAATTAAA AAATTAA
 
Protein sequence
MSSKVSIAQV GCQYSDDIIA AKMKQLFFDT GLNDYIAPNK QVLIKPNLVA VPPEDFRGAI 
THPLIVQKLS DIVRSLGGQV IIGDSSAVGV NTEDVISTTG YEKLRQQGYQ VIDLKQDSVV
DLQVPGGGKA LSQLPVAKTV KDVDLIISVP VMKTHDQVEV SLSIKNLKGL LPDKIKKAFH
NKYGLAKGVS DILATVPPVV SVLDATYALE GMGPVYGESV PMGLILASSD PVALDSIAAG
IMGLEEDELK IEGECYNRCL GELRRDKITI SGDVTDIDQV ARRFTRIKDL DYQFNVDFDL
IFNEEVCTGC KNTVMSSLDD IQTQGVEPYL SGKTVYAGPL TQGEISGSSN SILIGNCLYK
HKSQGTFVPG CPPENLPVIE GLVGEGKIAR RYTSENQSQF NHPWGIIYDL DNTLINSKIN
FNKMKVEVMN YLQEEQLLPE ITNLEKHTAA TLIQTARQHS TLDQEQEDGL WALITAIEAE
GMDKAETEPD INEVISTLAN EYTLIVLTNN SYKAAMKALK QFGLDEYFQL VVGREQMTSL
KPSPSGAEYI LEHFGDTKAE DWVMVGDSWI DAKAAQDASI PFLAYNCNLQ ELIDRDIPWE
ENLKHPWDII NYLDKLKN