Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0571 |
Symbol | |
ID | 6314713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 598571 |
End bp | 600427 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642642954 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 1 |
Protein accession | YP_001916754 |
Protein GI | 188585209 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG0546] Predicted phosphatases [COG2006] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAGCA AAGTTTCTAT AGCCCAAGTT GGTTGCCAGT ATAGTGATGA CATTATCGCT GCTAAAATGA AGCAGCTATT TTTTGACACG GGTTTAAATG ATTATATAGC TCCAAATAAA CAGGTTCTGA TTAAGCCCAA CCTAGTGGCG GTTCCCCCGG AAGATTTTCG AGGAGCCATT ACTCATCCTT TAATAGTTCA AAAACTGTCA GATATAGTCC GATCCTTGGG AGGACAAGTT ATTATTGGTG ACTCATCAGC AGTAGGGGTC AACACAGAAG ATGTTATCTC TACTACGGGA TATGAGAAGC TAAGACAGCA GGGCTATCAA GTTATTGATT TAAAACAGGA TTCGGTAGTA GACCTTCAAG TTCCAGGAGG TGGAAAAGCA CTGTCCCAGT TGCCTGTAGC CAAAACAGTC AAAGACGTAG ATCTAATAAT TTCAGTACCA GTTATGAAAA CTCATGATCA GGTAGAAGTT AGCCTGAGTA TCAAAAATCT AAAAGGACTA CTACCAGATA AAATAAAAAA AGCATTTCAT AATAAATATG GCCTAGCTAA AGGAGTATCA GATATACTGG CAACTGTGCC TCCAGTGGTT TCAGTCCTAG ATGCCACCTA TGCATTAGAG GGTATGGGGC CTGTCTATGG AGAATCTGTA CCCATGGGAC TTATTTTAGC CAGCAGCGAT CCTGTGGCCT TAGATTCAAT AGCTGCAGGT ATAATGGGTT TGGAGGAAGA TGAACTTAAG ATTGAGGGAG AATGTTATAA TAGGTGTTTA GGTGAATTGA GACGAGATAA GATCACCATC TCAGGTGATG TGACAGATAT AGATCAAGTA GCGAGAAGAT TTACTCGGAT CAAAGATTTG GATTATCAAT TTAATGTTGA CTTTGATTTA ATATTTAATG AGGAAGTTTG TACAGGCTGT AAAAATACAG TTATGAGCTC CTTAGATGAT ATCCAAACTC AGGGAGTGGA GCCATATTTA TCGGGAAAAA CTGTATATGC CGGTCCATTA ACACAAGGTG AGATTAGTGG TAGCTCCAAC TCTATACTGA TCGGAAATTG CCTATACAAG CATAAAAGTC AAGGAACTTT TGTGCCCGGA TGTCCGCCGG AAAATTTACC TGTAATTGAA GGCCTTGTAG GTGAAGGAAA GATTGCCAGA CGTTATACTA GTGAAAATCA AAGTCAGTTT AATCATCCTT GGGGCATTAT CTACGATTTA GATAATACAT TGATCAACTC AAAAATAAAT TTTAATAAAA TGAAAGTAGA AGTCATGAAT TATCTCCAAG AAGAACAGCT TTTGCCCGAA ATTACTAACC TTGAGAAACA TACTGCTGCT ACCTTGATTC AAACTGCTCG TCAACATTCC ACTTTGGATC AGGAACAAGA AGATGGATTG TGGGCTTTAA TTACTGCCAT TGAAGCTGAA GGCATGGATA AGGCCGAAAC AGAACCTGAT ATCAATGAAG TGATTTCAAC TTTAGCTAAT GAGTATACTC TGATCGTATT GACTAATAAC TCATATAAAG CTGCTATGAA AGCTTTGAAA CAATTTGGCT TAGATGAATA CTTTCAATTG GTGGTAGGTA GGGAGCAGAT GACTAGTTTG AAACCATCTC CTTCTGGTGC GGAATACATC TTAGAGCATT TTGGTGATAC TAAAGCTGAA GATTGGGTCA TGGTAGGAGA TTCTTGGATA GACGCAAAAG CAGCCCAAGA TGCTTCAATA CCATTTTTAG CCTACAATTG TAATCTACAA GAGTTGATTG ACCGGGACAT TCCTTGGGAA GAAAATTTAA AGCACCCCTG GGATATAATA AACTATCTAG ATAAATTAAA AAATTAA
|
Protein sequence | MSSKVSIAQV GCQYSDDIIA AKMKQLFFDT GLNDYIAPNK QVLIKPNLVA VPPEDFRGAI THPLIVQKLS DIVRSLGGQV IIGDSSAVGV NTEDVISTTG YEKLRQQGYQ VIDLKQDSVV DLQVPGGGKA LSQLPVAKTV KDVDLIISVP VMKTHDQVEV SLSIKNLKGL LPDKIKKAFH NKYGLAKGVS DILATVPPVV SVLDATYALE GMGPVYGESV PMGLILASSD PVALDSIAAG IMGLEEDELK IEGECYNRCL GELRRDKITI SGDVTDIDQV ARRFTRIKDL DYQFNVDFDL IFNEEVCTGC KNTVMSSLDD IQTQGVEPYL SGKTVYAGPL TQGEISGSSN SILIGNCLYK HKSQGTFVPG CPPENLPVIE GLVGEGKIAR RYTSENQSQF NHPWGIIYDL DNTLINSKIN FNKMKVEVMN YLQEEQLLPE ITNLEKHTAA TLIQTARQHS TLDQEQEDGL WALITAIEAE GMDKAETEPD INEVISTLAN EYTLIVLTNN SYKAAMKALK QFGLDEYFQL VVGREQMTSL KPSPSGAEYI LEHFGDTKAE DWVMVGDSWI DAKAAQDASI PFLAYNCNLQ ELIDRDIPWE ENLKHPWDII NYLDKLKN
|
| |