Gene Nmar_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0848 
Symbol 
ID5774220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp747812 
End bp749020 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content37% 
IMG OID641316486 
Productthreonine dehydratase 
Protein accessionYP_001582182 
Protein GI161528356 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00000460563 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAACCAA CATTTGACGA GATACAAAAA GCAAACTCTA TGCGAGGAAA TGAAATAAAG 
AAAACTCCTT TAATTCATTC ACCTACTTTT AGTGAGCTTA CAAAGTCTGA AGTTTATCTT
AAAGCAGAGT TCCGACAAAA AACTGGTTCA TTTAAAATTC GTGGCGCTTA TTACAAAATC
AAATCATTAT CTGATGAAGA AAAGAAACAA GGAGTGGTTG CAGCATCTGC TGGAAATCAT
GCACAAGGTG TTGCATTAGC TTCAGCACTT GAAGAAATTC CTTGTACTAT AGTTATGCCA
AAAAATGCAT CCCCTGCAAA AGTGGCTGCA ACAAAAGGCT ATGGTGCAAA TGTGGTTCTA
GAAGGTGTAA ACTATGATGA ATCTTCTGCA AAAGCAAAAG AGATTGCAAA AGAAACTGGG
GCAACTATGA TACATGCATT TGATGATCCT CAAATTATTG CAGCACAGGG TGTAATTGGT
TTAGAAATAC TAGAGGACTT GCCAGATGTT GATCAAGTGT ATCTTCCAAT AGGTGGTGGA
GGATTGGCTG CAGGTACCTT AATTGCAATT AAAGAAAAAA ACCCCAATGT TCAGGTTATT
GGAGTACAAT CAAGATCTTT TCCATCAATG TATGAATCAG TAAAACAAGG ATCAATCACT
GCAAGTGGAG GTGCAAGAAC AATTGCAGAT GGTATATCAG TAAAAGTTCC AGGACAGTTA
ACGTTTAGTG TAATTAATGA ACTCATAGAC GAAGTTGTTC TAGTAGATGA TACTGAAATT
ACAAAAGCAA TGTTTCTCTT AATGGAGAGG ATGAAATTTG TAGTAGAACC CGCAGGTGCT
GCCAGTTTAG CTTATCTAAT TTCAAAAAAA CCTGCCCCAG GCAAAAAAGT AGTTGCAGTA
TTGGCAGGAG GAAATGTGGA TATGTATCTC TTGGGACAAA TAGTAGACAA AGGTCTTGCT
GCTATGGGTA GATTATTGAA ATTATCAGTA TTGCTACCTG ACAGACCAGG TTCATTCAAA
GAAATTGTTG ATGTCATTAC CCTTGCAAAT GCCAACATTG TAGAAGTTGT TCATGACAGA
TTAAGTTCAA ATGTTAATGC AGGTTCGGCA TCAGTTACTA TGAATTTAGA AACTCAAGGA
AAAGAGCAAG CAGATGCGTT AATTGAAGCA TTAAGAAAGA AAGATGTTCA ATTCACATTA
TTGACATAA
 
Protein sequence
MEPTFDEIQK ANSMRGNEIK KTPLIHSPTF SELTKSEVYL KAEFRQKTGS FKIRGAYYKI 
KSLSDEEKKQ GVVAASAGNH AQGVALASAL EEIPCTIVMP KNASPAKVAA TKGYGANVVL
EGVNYDESSA KAKEIAKETG ATMIHAFDDP QIIAAQGVIG LEILEDLPDV DQVYLPIGGG
GLAAGTLIAI KEKNPNVQVI GVQSRSFPSM YESVKQGSIT ASGGARTIAD GISVKVPGQL
TFSVINELID EVVLVDDTEI TKAMFLLMER MKFVVEPAGA ASLAYLISKK PAPGKKVVAV
LAGGNVDMYL LGQIVDKGLA AMGRLLKLSV LLPDRPGSFK EIVDVITLAN ANIVEVVHDR
LSSNVNAGSA SVTMNLETQG KEQADALIEA LRKKDVQFTL LT