Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0489 |
Symbol | |
ID | 6314682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 517473 |
End bp | 519212 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642642873 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_001916673 |
Protein GI | 188585128 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.101404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGTTC AAAAAAAGAC TGTTTTAAAA TTTAATATTC CAACTTTAAT TTTAGGATTG ATTATTTTCA TTTTAATCTT GGCTAGTTTA CTTACTTATC TCTACCTCAC ACCTAGTCGC CAAAAGGTAG AACCCTATCC GGACTCAGAA CAAACACGAC TCATTTACGA AGACCAAGAT TTAGGAACTG AAAATACTGC TTTAGTAGAA GATGAAGTTT ATTTGTCAAA AAGTTTTATT GAGAAGTCAT TAGACCCCTA CATATATTAC GATGATCAGG GTGACTACAT AACAATAACT ACCAAAAATA AATTTTTCCA AATGCAAACA GATAATTTAA CTGCTGCAGT GAATGATGAA GAAAGAGAAT TACAGTTCTC TCCAAAACTT ATCCAAGGTG AGCCTTTTCT TCCCTCGGAT ATTTTGGAAG AACTCTACCC TATCACTGTC CAACATAATC AAGATGATAA TTTAGTCTTA ATAGATAATT TAGAAAAACA GGTACTTAAA GGATATGCCA AAGAAAACGA AACAAGGGTT AGAGAAGAAC CAAGTATTCG AACTGGAATC ATAGAAAGTT TGGACAAGCA ATCTAAGATA ACCATATATC ATCAAAGTGG AGATTGGTTT TTTATACGGA CCGAAAAGGG TTATTTAGGT TATGTTCAAA GTAGTGATAT AACTTTAGAT GAACTTTCAA TTAGAGAAAA ACATATTGAA GACGATGATG TTGATCCCCC TAGCCGTCCC TTCGGGGAAC CCATTAATTT GACCTGGGAA CATGTAATCA GTGAAAATCC AAATCCCAAT GATCTCCCTG ATTTACCAGG GGTGAATGTG ATCAGTCCCA CCTGGTTTCA CTTGCAAGAT CCGGAAGGTA ATATTGTCAA CCAAGCAGAT AAATCTTACG TAAATTGGGC CCATGATAAT GATATGCAAG TATGGGGGTT GTTCTCCAAT AACTTTGATC CAGATCTAAC TTCTGAGTTT TTAGAAGACC CTGAGGCAAG AAGGAATGCA ATTGAACAAA TTTTGCTTTA TTCTGATATA TATAATTTAG ACGGTATCAA CTTAGACTTT GAAAATATTC ATATCGATGA TAGAGATGCC TATACTCAGT TTGTCAGAGA AATTTCGCCT TTTTTAAGGG AACAGGGGTT AGTGGTTTCA ATTGATGTGA CCTTTATATC TCAAAGCGAA AATTGGTCTT TAAGTTACGA TAGACATGCT TTTAGCAAGA CCGTAGACTA TATTATGGTT ATGGCATATG ATGAACATTG GGGGGATAGC CCGGTAGCAG GTTCAGTATC AAGTCTTCCC TGGGTAGAAG AGAATTTAGA AGAAATTTTA TCGGTGGTTC CCAATGATCA ACTTATCTTA GGTATCCCTT TTTATACTAG ATTATGGGAA GAAGAAAAGA ACGATGATGG CTCAATTTCA GTGTCTTCAT CTGCCTATGG TATGGAACAG ATCAAAAACA TATTAGAAAA CAATGATGCA GATATCACCT TTGATGACGA TACTAAGCAA TATTACGCTG AATATGAAGA TAACGACACA CGATACAGGA CCTGGCTTGA AACTGAAGAG TCAATTAAAA AAAGGTTAAA TTTAGTTCAT GAATATAACT TGGCAGGTGT TGCCTCTTGG AGAAGAGGAT TTGAAAAAGA GGAAATCTGG GAGGTAATTA AAAACAAGAT GGGAATCCCT AAGGATCCGG GATCTAAAAT AACTGATTGA
|
Protein sequence | MEVQKKTVLK FNIPTLILGL IIFILILASL LTYLYLTPSR QKVEPYPDSE QTRLIYEDQD LGTENTALVE DEVYLSKSFI EKSLDPYIYY DDQGDYITIT TKNKFFQMQT DNLTAAVNDE ERELQFSPKL IQGEPFLPSD ILEELYPITV QHNQDDNLVL IDNLEKQVLK GYAKENETRV REEPSIRTGI IESLDKQSKI TIYHQSGDWF FIRTEKGYLG YVQSSDITLD ELSIREKHIE DDDVDPPSRP FGEPINLTWE HVISENPNPN DLPDLPGVNV ISPTWFHLQD PEGNIVNQAD KSYVNWAHDN DMQVWGLFSN NFDPDLTSEF LEDPEARRNA IEQILLYSDI YNLDGINLDF ENIHIDDRDA YTQFVREISP FLREQGLVVS IDVTFISQSE NWSLSYDRHA FSKTVDYIMV MAYDEHWGDS PVAGSVSSLP WVEENLEEIL SVVPNDQLIL GIPFYTRLWE EEKNDDGSIS VSSSAYGMEQ IKNILENNDA DITFDDDTKQ YYAEYEDNDT RYRTWLETEE SIKKRLNLVH EYNLAGVASW RRGFEKEEIW EVIKNKMGIP KDPGSKITD
|
| |