Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2817 |
Symbol | |
ID | 6316811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 3045893 |
End bp | 3047164 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642645189 |
Product | imidazolonepropionase |
Protein accession | YP_001918953 |
Protein GI | 188587408 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAAG ATCTAAAAGC TGTTGATAAA ATAGTTATTA ACGCAGGAGA GCTAGTAACA GCAGCTGGTA ATAGTGACCG CCCCAAAGTT GGGACAGAAC TTAACAACCT CGGTTTAATT CAAAATGGGG CTGTAGCTAT TAAAGGTGAC AGAATTGAAG CTGTGGGGAC TACCGAGTCA GTAATTGAAC AAGTGGAAGT CACAGCTAAA ACTAAAGTGA TAGACGCCCG TGGGAAAACA GTAATGCCTG GTTTTGTTGA CCCACACACT CACATTATAT TTGGCTCTAC CAGGGAAAAA GAACTGGGAT TGCGTATCGA AGGAGCGGAA TATTTAGAAA TATTAAAAGC CGGAGGAGGC ATTTTAGGTA CTGTCGAATC TACGCGCCAA ACATCGGAAG ATGAATTATA TTATGCTGGA AAACAGCGAT TAGACACTTT TTTAAAAGAG GGAGTAACAA CCGTAGAATC CAAAAGTGGC TATGGCCTTG ATACAGCAAC AGAACTTAAA CAGCTAAGAG TTGCTAACAA ACTTGGGAAA ACTCATCCCG CAGATGTAGT TCATACATTT TTGGGAGCCC ATGCCATTCC CAAAGAGCAT AAAGATAACC CGGACAAGTA TATAGAAATA GTTGTAGAAG AGATGTTGCC CCGGGTTATA GAAGAAAATC TGGCGGAATT TTGTGATGTG TTTTGTGAAG AAGGTGTATT CAACATAGAT CAGTCCAGGA AAGTCTTACA AACTGCCAGG ACAAAAGGTA TATTGCCCAA AATCCATGCC GATGAGATTA ATCCCCTGGG AGGGGCAGAA CTGGCCGCAG AATTAGGTGC CGTCTCTGCA GATCATCTTG GTAAAGCTAC AGATCGAGGT ATAAAAGAAA TGGCAGAAAA AGGAGTAGTG GCTGTTCTAC TTCCCGGGAC ACTGTTTTTC TTAATGAAGG ATGAGTATGC TAGAGGCAGA AGAATGGTTG AAGAAGGAGT TCCTGTAGCC TTATCCACGG ATAGAAATCC TGGTTCATCA CCAACTGAAT CTATGAGTTT AATTGTTTCT CTTGCCTGTT TAAAAATGAA ATTACTCCCC AGTGAGGCGA TCAATGCAGC TACTATAAAT GCTGCACATG CCATTAACAG GGGACATCTC ATTGGAAGTA TCGAAAAAGG TAAACAAGCT GATATTGTGA TTTTTGATAT GCCGAACCAT GAGTACTTGC CTTATCATTA TGGGATTAAT CACGTGGAAC GTGTGATTAA GAAAGGTACT ATTGTGGTTT GA
|
Protein sequence | MVKDLKAVDK IVINAGELVT AAGNSDRPKV GTELNNLGLI QNGAVAIKGD RIEAVGTTES VIEQVEVTAK TKVIDARGKT VMPGFVDPHT HIIFGSTREK ELGLRIEGAE YLEILKAGGG ILGTVESTRQ TSEDELYYAG KQRLDTFLKE GVTTVESKSG YGLDTATELK QLRVANKLGK THPADVVHTF LGAHAIPKEH KDNPDKYIEI VVEEMLPRVI EENLAEFCDV FCEEGVFNID QSRKVLQTAR TKGILPKIHA DEINPLGGAE LAAELGAVSA DHLGKATDRG IKEMAEKGVV AVLLPGTLFF LMKDEYARGR RMVEEGVPVA LSTDRNPGSS PTESMSLIVS LACLKMKLLP SEAINAATIN AAHAINRGHL IGSIEKGKQA DIVIFDMPNH EYLPYHYGIN HVERVIKKGT IVV
|
| |