Gene Nther_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2075 
Symbol 
ID6316059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2194871 
End bp2196061 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content35% 
IMG OID642644463 
Producthydrolase (HAD superfamily) 
Protein accessionYP_001918230 
Protein GI188586685 
COG category[R] General function prediction only 
COG ID[COG1896] Predicted hydrolases of HD superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.957527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAAAG GTGAACTCTT AGAAACCCTA TTTGAAGCAG CCCATATTCA GAGGTGGAAT 
GATCACTTAA GACCTCACAA CTTTACTGAA TTAGATAAAC AAGCGCATAA AATGGTTTTA
GCTTATGTTA TAGGAAAATT TGAAGAACAA GACAAAGATG CGCGAATAGA TTGGTCTCAA
TTAATTGAAG GAGGAATTTT TGAATTTCTC CAGCGTATCA TGTTGACAGA TATAAAACCA
CCAATTTATC ACAAATTAAT GGAAGAAAGC GGTCGGGAAT TAAATCACTG GGTTTATGAA
CAACTGAAGG ATATTTTAAA ACCTGTAGCA GGAGACCTGA ATCAAAAGTT TGAACAATAC
CTGTTTGACG ACAATTACTC ATCTTATGAG AAAAAAATAC TACGGGCTTC TCATTATCTT
GCTACCAATT GGGAGTTCAA CGTTATATAC CAGTTTAATT CGCATATTTA CGGAGTTGAA
GAAACTAAAA ACAAAATTGA AAATGAATTA GAGGATCATT ATGATTTATT GGGTGTACAG
AAATTAGGTC TGAAAAACAA AACTTATAAC TTCATCGAGC TAGTAGGACA ACTAAGATTT
CAAAAAAGAT GGGCTAATTC ACCCAGGGTT CCGGAAACTT CTGTTCTAGG TCATATGTTA
CTAGTAGCTA TTTTTTCATA TCTATTTTCT CTTGAATTAT CAGCCTGTGA CAAGCGAAAA
TATAACAACT TTTTCGCTGG TCTTTATCAC GATCTACCGG AAGTCATGAC AAGAGATATC
ATCTCCCCAG TAAAAAAGTC AGTTAAGGGC TTAGATTCGC TCATCAACGA TATTGAAGAC
CGACAATTAG AAGAAAAAAT TCTCCCATTA CTGCCTAGAA AATGGCATGA AGAGCTCAGA
TATTTTTTAA ATGACGAGTT TGCTACAAAA ATATTTATTT CAGATACAAC CCAAAAAGTC
AGTACTGATG AAATTAATCG CTTCTATAAT AGGTATGAAT TTTCGCCCGT AGACGGAGAA
TTAATAAAAG TAGCCGATGA AATTTCCGCT TATATGGAAG CTTCTCAATC AATTCATCAC
GGGATAAGCT CAAAACACCT AGCTGAAGCC AAGGTAGAAT TATATAAAAA GTATCAGCAT
CAGATAATTT CAGGGATAGA GCTAGGAGCT TTATTCGACT ACTTTTACTA G
 
Protein sequence
MIKGELLETL FEAAHIQRWN DHLRPHNFTE LDKQAHKMVL AYVIGKFEEQ DKDARIDWSQ 
LIEGGIFEFL QRIMLTDIKP PIYHKLMEES GRELNHWVYE QLKDILKPVA GDLNQKFEQY
LFDDNYSSYE KKILRASHYL ATNWEFNVIY QFNSHIYGVE ETKNKIENEL EDHYDLLGVQ
KLGLKNKTYN FIELVGQLRF QKRWANSPRV PETSVLGHML LVAIFSYLFS LELSACDKRK
YNNFFAGLYH DLPEVMTRDI ISPVKKSVKG LDSLINDIED RQLEEKILPL LPRKWHEELR
YFLNDEFATK IFISDTTQKV STDEINRFYN RYEFSPVDGE LIKVADEISA YMEASQSIHH
GISSKHLAEA KVELYKKYQH QIISGIELGA LFDYFY