Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0848 |
Symbol | |
ID | 5774220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 747812 |
End bp | 749020 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316486 |
Product | threonine dehydratase |
Protein accession | YP_001582182 |
Protein GI | 161528356 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00000460563 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAACCAA CATTTGACGA GATACAAAAA GCAAACTCTA TGCGAGGAAA TGAAATAAAG AAAACTCCTT TAATTCATTC ACCTACTTTT AGTGAGCTTA CAAAGTCTGA AGTTTATCTT AAAGCAGAGT TCCGACAAAA AACTGGTTCA TTTAAAATTC GTGGCGCTTA TTACAAAATC AAATCATTAT CTGATGAAGA AAAGAAACAA GGAGTGGTTG CAGCATCTGC TGGAAATCAT GCACAAGGTG TTGCATTAGC TTCAGCACTT GAAGAAATTC CTTGTACTAT AGTTATGCCA AAAAATGCAT CCCCTGCAAA AGTGGCTGCA ACAAAAGGCT ATGGTGCAAA TGTGGTTCTA GAAGGTGTAA ACTATGATGA ATCTTCTGCA AAAGCAAAAG AGATTGCAAA AGAAACTGGG GCAACTATGA TACATGCATT TGATGATCCT CAAATTATTG CAGCACAGGG TGTAATTGGT TTAGAAATAC TAGAGGACTT GCCAGATGTT GATCAAGTGT ATCTTCCAAT AGGTGGTGGA GGATTGGCTG CAGGTACCTT AATTGCAATT AAAGAAAAAA ACCCCAATGT TCAGGTTATT GGAGTACAAT CAAGATCTTT TCCATCAATG TATGAATCAG TAAAACAAGG ATCAATCACT GCAAGTGGAG GTGCAAGAAC AATTGCAGAT GGTATATCAG TAAAAGTTCC AGGACAGTTA ACGTTTAGTG TAATTAATGA ACTCATAGAC GAAGTTGTTC TAGTAGATGA TACTGAAATT ACAAAAGCAA TGTTTCTCTT AATGGAGAGG ATGAAATTTG TAGTAGAACC CGCAGGTGCT GCCAGTTTAG CTTATCTAAT TTCAAAAAAA CCTGCCCCAG GCAAAAAAGT AGTTGCAGTA TTGGCAGGAG GAAATGTGGA TATGTATCTC TTGGGACAAA TAGTAGACAA AGGTCTTGCT GCTATGGGTA GATTATTGAA ATTATCAGTA TTGCTACCTG ACAGACCAGG TTCATTCAAA GAAATTGTTG ATGTCATTAC CCTTGCAAAT GCCAACATTG TAGAAGTTGT TCATGACAGA TTAAGTTCAA ATGTTAATGC AGGTTCGGCA TCAGTTACTA TGAATTTAGA AACTCAAGGA AAAGAGCAAG CAGATGCGTT AATTGAAGCA TTAAGAAAGA AAGATGTTCA ATTCACATTA TTGACATAA
|
Protein sequence | MEPTFDEIQK ANSMRGNEIK KTPLIHSPTF SELTKSEVYL KAEFRQKTGS FKIRGAYYKI KSLSDEEKKQ GVVAASAGNH AQGVALASAL EEIPCTIVMP KNASPAKVAA TKGYGANVVL EGVNYDESSA KAKEIAKETG ATMIHAFDDP QIIAAQGVIG LEILEDLPDV DQVYLPIGGG GLAAGTLIAI KEKNPNVQVI GVQSRSFPSM YESVKQGSIT ASGGARTIAD GISVKVPGQL TFSVINELID EVVLVDDTEI TKAMFLLMER MKFVVEPAGA ASLAYLISKK PAPGKKVVAV LAGGNVDMYL LGQIVDKGLA AMGRLLKLSV LLPDRPGSFK EIVDVITLAN ANIVEVVHDR LSSNVNAGSA SVTMNLETQG KEQADALIEA LRKKDVQFTL LT
|
| |