Gene Nmar_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0833 
Symbol 
ID5774156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp736555 
End bp737892 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content36% 
IMG OID641316471 
Productthreonine synthase 
Protein accessionYP_001582167 
Protein GI161528341 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000348992 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGGAG ACGCATATCT CAAGTGTATT GATCCACAAT GCGGTATGGA ATACCCTATT 
GAGAGCACAA ATGTTCAATG TGAAAAAGGA CATCTGCTTG ATGTAAAATA CAAACATACT
CCCTCAGAAA ATCTTAAAGA AGTTTTTTAC AATAGAAGAA ACTCGCAAGG GAACATATTC
AATGAAAGTG GAGTTTGGAG ATTCAGAGAA TTACTAAACT TTTGTCAAAT AGATACTGAA
AACATTGAAG AGTGTTCAAA ATATTTGGTT TCACTTGATG GTGCAGAAGG AAGACTCTCA
AAGCCTTACC ATATGGCAAA GGCTTCAGAA CTTGTAGGAA TTTCAAATGA TAATTTATGG
TTACAGCCTG AAGGATACAA TCCAAGTGGT TCTTTCAAAG ATAATGGCAT GGCAACTGCA
GTAACTCATG CAAAAATGGT AGGTGCAAAA AAGATTGTTT GTGCATCTAC TGGAAACACT
TCAGCATCTG CAGGTATGTT TGCAGCAAAT GAAGGGATAA ATTGTGATGT ATACATTCCC
GCAGGACAAA TTGCTCCAGG AAAATTAAGC CAAGCGTATC AGTTTGGAGC TCAGATTTTA
GAAATTGATG GAAATTTTGA TGATGCTCTA AAACAATCAT TAGATGATGC ACAGAATCAT
GATGGGTATA CTGTAAATTC TGTTAATCCA TTTAGAATTG AAGGACAAAA AACCATACCA
TTTAGAGCAT TAGAATATTT GAATTGGGAA GTTCCAGATT GGATTGTTTA TCCAGGTGGT
GCATTAGGGA ACACATCTAG TTGTGGAAAA GCATTGATGG AACTATATGA ATGGGGATGG
ATTAAAAAAA TTCCAAGAAT TGCAGTGATA AACTCTGAAG GTGCAAGTAC ATTATCTGAT
TTGTATAATG GTAAATTTGA AGGAGAGGAA TTAAGATGGA ATAAAGGAAA CCCCAATACT
GAATTAATTA CAAGATATTA TGATGATTTA GACTCAAAAG GAATTAGACC AAAAACCAAA
GCTACTGCAA TTCAAATTGG TAGACCTGCA AATATTTTGA AAGGATTACG CGCACTTGAA
TTTACAAACG GTGTTGCAAC AACTGTTTCA GATTCTGAAA TGCTTGATGG AATGTCAGTA
GTAGGACTAA ACGGTTTTGA TTGTGAGATG GCATCAGGTG CATCAGTGGT TGGAGTTAAG
AAATTGACGA GTGAGGGAAT AATACAAAAA GACGACACAG TGGTTGGAAT TCTTACAGGC
AGACAAAAAG ATGCAATGCT TCCAGTAGAT TATCACAATA ATCCCCAAAA CAAGTTTGCA
AAACCTCCAA AAAATTAG
 
Protein sequence
MQGDAYLKCI DPQCGMEYPI ESTNVQCEKG HLLDVKYKHT PSENLKEVFY NRRNSQGNIF 
NESGVWRFRE LLNFCQIDTE NIEECSKYLV SLDGAEGRLS KPYHMAKASE LVGISNDNLW
LQPEGYNPSG SFKDNGMATA VTHAKMVGAK KIVCASTGNT SASAGMFAAN EGINCDVYIP
AGQIAPGKLS QAYQFGAQIL EIDGNFDDAL KQSLDDAQNH DGYTVNSVNP FRIEGQKTIP
FRALEYLNWE VPDWIVYPGG ALGNTSSCGK ALMELYEWGW IKKIPRIAVI NSEGASTLSD
LYNGKFEGEE LRWNKGNPNT ELITRYYDDL DSKGIRPKTK ATAIQIGRPA NILKGLRALE
FTNGVATTVS DSEMLDGMSV VGLNGFDCEM ASGASVVGVK KLTSEGIIQK DDTVVGILTG
RQKDAMLPVD YHNNPQNKFA KPPKN