Gene Nther_2817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2817 
Symbol 
ID6316811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp3045893 
End bp3047164 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content41% 
IMG OID642645189 
Productimidazolonepropionase 
Protein accessionYP_001918953 
Protein GI188587408 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAAG ATCTAAAAGC TGTTGATAAA ATAGTTATTA ACGCAGGAGA GCTAGTAACA 
GCAGCTGGTA ATAGTGACCG CCCCAAAGTT GGGACAGAAC TTAACAACCT CGGTTTAATT
CAAAATGGGG CTGTAGCTAT TAAAGGTGAC AGAATTGAAG CTGTGGGGAC TACCGAGTCA
GTAATTGAAC AAGTGGAAGT CACAGCTAAA ACTAAAGTGA TAGACGCCCG TGGGAAAACA
GTAATGCCTG GTTTTGTTGA CCCACACACT CACATTATAT TTGGCTCTAC CAGGGAAAAA
GAACTGGGAT TGCGTATCGA AGGAGCGGAA TATTTAGAAA TATTAAAAGC CGGAGGAGGC
ATTTTAGGTA CTGTCGAATC TACGCGCCAA ACATCGGAAG ATGAATTATA TTATGCTGGA
AAACAGCGAT TAGACACTTT TTTAAAAGAG GGAGTAACAA CCGTAGAATC CAAAAGTGGC
TATGGCCTTG ATACAGCAAC AGAACTTAAA CAGCTAAGAG TTGCTAACAA ACTTGGGAAA
ACTCATCCCG CAGATGTAGT TCATACATTT TTGGGAGCCC ATGCCATTCC CAAAGAGCAT
AAAGATAACC CGGACAAGTA TATAGAAATA GTTGTAGAAG AGATGTTGCC CCGGGTTATA
GAAGAAAATC TGGCGGAATT TTGTGATGTG TTTTGTGAAG AAGGTGTATT CAACATAGAT
CAGTCCAGGA AAGTCTTACA AACTGCCAGG ACAAAAGGTA TATTGCCCAA AATCCATGCC
GATGAGATTA ATCCCCTGGG AGGGGCAGAA CTGGCCGCAG AATTAGGTGC CGTCTCTGCA
GATCATCTTG GTAAAGCTAC AGATCGAGGT ATAAAAGAAA TGGCAGAAAA AGGAGTAGTG
GCTGTTCTAC TTCCCGGGAC ACTGTTTTTC TTAATGAAGG ATGAGTATGC TAGAGGCAGA
AGAATGGTTG AAGAAGGAGT TCCTGTAGCC TTATCCACGG ATAGAAATCC TGGTTCATCA
CCAACTGAAT CTATGAGTTT AATTGTTTCT CTTGCCTGTT TAAAAATGAA ATTACTCCCC
AGTGAGGCGA TCAATGCAGC TACTATAAAT GCTGCACATG CCATTAACAG GGGACATCTC
ATTGGAAGTA TCGAAAAAGG TAAACAAGCT GATATTGTGA TTTTTGATAT GCCGAACCAT
GAGTACTTGC CTTATCATTA TGGGATTAAT CACGTGGAAC GTGTGATTAA GAAAGGTACT
ATTGTGGTTT GA
 
Protein sequence
MVKDLKAVDK IVINAGELVT AAGNSDRPKV GTELNNLGLI QNGAVAIKGD RIEAVGTTES 
VIEQVEVTAK TKVIDARGKT VMPGFVDPHT HIIFGSTREK ELGLRIEGAE YLEILKAGGG
ILGTVESTRQ TSEDELYYAG KQRLDTFLKE GVTTVESKSG YGLDTATELK QLRVANKLGK
THPADVVHTF LGAHAIPKEH KDNPDKYIEI VVEEMLPRVI EENLAEFCDV FCEEGVFNID
QSRKVLQTAR TKGILPKIHA DEINPLGGAE LAAELGAVSA DHLGKATDRG IKEMAEKGVV
AVLLPGTLFF LMKDEYARGR RMVEEGVPVA LSTDRNPGSS PTESMSLIVS LACLKMKLLP
SEAINAATIN AAHAINRGHL IGSIEKGKQA DIVIFDMPNH EYLPYHYGIN HVERVIKKGT
IVV