Gene Nther_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0489 
Symbol 
ID6314682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp517473 
End bp519212 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content36% 
IMG OID642642873 
Productglycoside hydrolase family 18 
Protein accessionYP_001916673 
Protein GI188585128 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.101404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTTC AAAAAAAGAC TGTTTTAAAA TTTAATATTC CAACTTTAAT TTTAGGATTG 
ATTATTTTCA TTTTAATCTT GGCTAGTTTA CTTACTTATC TCTACCTCAC ACCTAGTCGC
CAAAAGGTAG AACCCTATCC GGACTCAGAA CAAACACGAC TCATTTACGA AGACCAAGAT
TTAGGAACTG AAAATACTGC TTTAGTAGAA GATGAAGTTT ATTTGTCAAA AAGTTTTATT
GAGAAGTCAT TAGACCCCTA CATATATTAC GATGATCAGG GTGACTACAT AACAATAACT
ACCAAAAATA AATTTTTCCA AATGCAAACA GATAATTTAA CTGCTGCAGT GAATGATGAA
GAAAGAGAAT TACAGTTCTC TCCAAAACTT ATCCAAGGTG AGCCTTTTCT TCCCTCGGAT
ATTTTGGAAG AACTCTACCC TATCACTGTC CAACATAATC AAGATGATAA TTTAGTCTTA
ATAGATAATT TAGAAAAACA GGTACTTAAA GGATATGCCA AAGAAAACGA AACAAGGGTT
AGAGAAGAAC CAAGTATTCG AACTGGAATC ATAGAAAGTT TGGACAAGCA ATCTAAGATA
ACCATATATC ATCAAAGTGG AGATTGGTTT TTTATACGGA CCGAAAAGGG TTATTTAGGT
TATGTTCAAA GTAGTGATAT AACTTTAGAT GAACTTTCAA TTAGAGAAAA ACATATTGAA
GACGATGATG TTGATCCCCC TAGCCGTCCC TTCGGGGAAC CCATTAATTT GACCTGGGAA
CATGTAATCA GTGAAAATCC AAATCCCAAT GATCTCCCTG ATTTACCAGG GGTGAATGTG
ATCAGTCCCA CCTGGTTTCA CTTGCAAGAT CCGGAAGGTA ATATTGTCAA CCAAGCAGAT
AAATCTTACG TAAATTGGGC CCATGATAAT GATATGCAAG TATGGGGGTT GTTCTCCAAT
AACTTTGATC CAGATCTAAC TTCTGAGTTT TTAGAAGACC CTGAGGCAAG AAGGAATGCA
ATTGAACAAA TTTTGCTTTA TTCTGATATA TATAATTTAG ACGGTATCAA CTTAGACTTT
GAAAATATTC ATATCGATGA TAGAGATGCC TATACTCAGT TTGTCAGAGA AATTTCGCCT
TTTTTAAGGG AACAGGGGTT AGTGGTTTCA ATTGATGTGA CCTTTATATC TCAAAGCGAA
AATTGGTCTT TAAGTTACGA TAGACATGCT TTTAGCAAGA CCGTAGACTA TATTATGGTT
ATGGCATATG ATGAACATTG GGGGGATAGC CCGGTAGCAG GTTCAGTATC AAGTCTTCCC
TGGGTAGAAG AGAATTTAGA AGAAATTTTA TCGGTGGTTC CCAATGATCA ACTTATCTTA
GGTATCCCTT TTTATACTAG ATTATGGGAA GAAGAAAAGA ACGATGATGG CTCAATTTCA
GTGTCTTCAT CTGCCTATGG TATGGAACAG ATCAAAAACA TATTAGAAAA CAATGATGCA
GATATCACCT TTGATGACGA TACTAAGCAA TATTACGCTG AATATGAAGA TAACGACACA
CGATACAGGA CCTGGCTTGA AACTGAAGAG TCAATTAAAA AAAGGTTAAA TTTAGTTCAT
GAATATAACT TGGCAGGTGT TGCCTCTTGG AGAAGAGGAT TTGAAAAAGA GGAAATCTGG
GAGGTAATTA AAAACAAGAT GGGAATCCCT AAGGATCCGG GATCTAAAAT AACTGATTGA
 
Protein sequence
MEVQKKTVLK FNIPTLILGL IIFILILASL LTYLYLTPSR QKVEPYPDSE QTRLIYEDQD 
LGTENTALVE DEVYLSKSFI EKSLDPYIYY DDQGDYITIT TKNKFFQMQT DNLTAAVNDE
ERELQFSPKL IQGEPFLPSD ILEELYPITV QHNQDDNLVL IDNLEKQVLK GYAKENETRV
REEPSIRTGI IESLDKQSKI TIYHQSGDWF FIRTEKGYLG YVQSSDITLD ELSIREKHIE
DDDVDPPSRP FGEPINLTWE HVISENPNPN DLPDLPGVNV ISPTWFHLQD PEGNIVNQAD
KSYVNWAHDN DMQVWGLFSN NFDPDLTSEF LEDPEARRNA IEQILLYSDI YNLDGINLDF
ENIHIDDRDA YTQFVREISP FLREQGLVVS IDVTFISQSE NWSLSYDRHA FSKTVDYIMV
MAYDEHWGDS PVAGSVSSLP WVEENLEEIL SVVPNDQLIL GIPFYTRLWE EEKNDDGSIS
VSSSAYGMEQ IKNILENNDA DITFDDDTKQ YYAEYEDNDT RYRTWLETEE SIKKRLNLVH
EYNLAGVASW RRGFEKEEIW EVIKNKMGIP KDPGSKITD