Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1118 |
Symbol | |
ID | 6315347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1184587 |
End bp | 1185753 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642643490 |
Product | amidohydrolase |
Protein accession | YP_001917289 |
Protein GI | 188585744 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00577385 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.373406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTTCAG AAATAAAAAC TGAGGTTATT AAAGAAATAA AGAACTTAGA AAAAGAACTT GGCTCAATTG CCGATTTTAT CCATCAAAAT CCCGAACCTG GACTTGAAGA ATTTCAGGCA GTGAAACTCT TAACTAATAA ATTATCCCAG GAAGGTTTTG AAGTACAACA ACCAATCGCC GGACTTGAAA CTGCTTTTAA GGCCTCCTAT CAATCACAAC ACTCCCCGTA TCCTAAGATC GGATTTTTAG CGGAGTACGA TGCTTTACCT GAAGGACATT CATGTGGGCA TAATTTAATT GCTGCCATGA GTTACGGGGC TGGTGTAGCT CTAAAACGAT TGCTAGATAA ATTTCAAGGT GGAACCATTG AGATATACGG TACACCTGCC GAGGAAACTG ATGGTGCCAA GGTCACTATG GTGGAACAAG GAATTTTTAA CCATTTAGAT GCCGCCTTAA TCTGTCATCC TGGTAGCAAA AATATGGTCC TAGATAGTTC ATTAGCTATG GACGCTATAG AATTTAAATT TTATGGAAAA GCAGCTCATG CAGCTGCGGC TCCTCATGAA GGAATCAATG CACTAGATGC GGTAATTTCA CTCTTTAATA ATATTAACTC CTTACGCCAA CAATTAACAA CAGATGTACG TATTCACGGA ATCATCACAG AAGGGGGATC TGCCCCTAAT ATCATTCCTG AAAAAGGAGT AGCCCGATTC TATGTTAGAG CATCTGAAAG AGATTACTTA AATCAAGTGG TTTCAAAGGT GATTAATTGT GCCAGTGGTG CTGCCCAAGC TACAGGCTGT CAATATGAAT ATGATTATTT TGAATTATCC TTCGATAACA TGATAACTAA CAAAACCCTG GCAGATTCAT TTCAACAAAA TTTAAAAGAA CTGGGTGCAG TAATCCACGC CCCAGGGGGT AATTTTGGAT CAACAGATAT GGGTAATGTC AGCCACGTCA CACCCTCAAT CCACCCTTTC ATTTCAATAA GTTCTCGAGA TATTGCAGCA CATACAGATG CCTTTAAAGA AGCAGCTGGC TCTAAGGAAG GTAAACAAGG CATGTTATTG GGAGCCAAAG CTCTAGCAAT GACTGGTGCC GATTTGTTAG TTCAACCGGA GCTAATTGAC CGGATTAAAG CCGATTTTGA AAATTGA
|
Protein sequence | MSSEIKTEVI KEIKNLEKEL GSIADFIHQN PEPGLEEFQA VKLLTNKLSQ EGFEVQQPIA GLETAFKASY QSQHSPYPKI GFLAEYDALP EGHSCGHNLI AAMSYGAGVA LKRLLDKFQG GTIEIYGTPA EETDGAKVTM VEQGIFNHLD AALICHPGSK NMVLDSSLAM DAIEFKFYGK AAHAAAAPHE GINALDAVIS LFNNINSLRQ QLTTDVRIHG IITEGGSAPN IIPEKGVARF YVRASERDYL NQVVSKVINC ASGAAQATGC QYEYDYFELS FDNMITNKTL ADSFQQNLKE LGAVIHAPGG NFGSTDMGNV SHVTPSIHPF ISISSRDIAA HTDAFKEAAG SKEGKQGMLL GAKALAMTGA DLLVQPELID RIKADFEN
|
| |