Gene Nmar_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1549 
Symbol 
ID5773380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1418527 
End bp1419741 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content36% 
IMG OID641317201 
Productthreonine synthase 
Protein accessionYP_001582883 
Protein GI161529057 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.738086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTAGAA CTTCGCTGCA ATGTAGGGAA TGTAAAAAGG AGTATGAAAC TGCTTTCAAA 
TACATCTGTG ATGATTGTTT TGGACCTCTA GATGTGAAAT ATGATTTTCC AACAGTTACA
AAAGATACTT TTTCTAATCG TGAACACACA TACTGGAGAT ATTTTGAATT ACTGCCAATT
GAAAACAAAT CTAACATCGT GTCAATTGAT GCAGGACTAA CTCCTCTAAC AAAAGCTGAA
AATCTAGGCA AAGCACTTGA TCTTAACAAT CTCTATATCA AAAATGACTC TGTAAATCCT
ACATTTTCAT TTAAGGACAG ACCTGCTGGA GTTGCAATCT CAAAAGCAAA AGAATTTGGA
TTATCTGCAG TTGGCTGTGC ATCAACAGGT AATTTAGCAT CTGCAACTGC AGCTCATGCT
GCAAAAGGTG GATTCCCATG TCATGTATTT GCTCCAAGTA ATATTGAGAT GGCAAAGATT
GCTCAAGCAT TATCTTATGG CGCAAACTAT GTTGCAGTTG ATGGAACATA TGATGACGCA
AATAGAATTG CAGCTCAAAT TGGTGACTCT AGAGGAATTG GAATTGTAAA TATTAACATG
CGTTCACATT ATGTTGAAGG TTCAAAGACA TTAGCTTATG AAGTTGCAGA GCAATTAGAT
TGGAATGTTC CTGATCAACT TATAGTTCCA GTTGGAAGCG GTGCAATGTT AAATGCTATA
TGTAAAGGAT TTGAAGAACT ACAACAAGTT TCATTACTTG ATGATGTATC TAACATGCAT
ATGATTGCAG CACAACCTCA TGGTTGTGCT CCTGTTGTTG ATGCATTTAA GAAAAATTCT
AAAGATGTAA TTCCAGTTGA GAATCCTGAC ACTGTTGCAA AAAGTCTTGC AATAGGAGAT
CCTGGTGATG GGCGATATGT TCTAAAAAGA TTAGAACAAT ACAATGGATT TGCTGAAGAA
TGTAATAACA AAGAAATTCT TGATGCAATA CTTTTACTAG CAAAGACTGA AGGAATATTT
ACAGAACCTG CAGGTGGAGT ATCTGTTTCA GTATTACAGA AGATGGTAGA ACAAGGAAAG
ATTGACAAAA ATGATAAAGT TGTATGTTAT GTTACTGGAA ACGGACTCAA AGCAACTGAA
TCAATTATGG AAGTGTTAGA AAAACCAAAT GTACTAAAGG CAGACATTTC AGAAGTATCG
GCGGTAGTGA ACTAA
 
Protein sequence
MTRTSLQCRE CKKEYETAFK YICDDCFGPL DVKYDFPTVT KDTFSNREHT YWRYFELLPI 
ENKSNIVSID AGLTPLTKAE NLGKALDLNN LYIKNDSVNP TFSFKDRPAG VAISKAKEFG
LSAVGCASTG NLASATAAHA AKGGFPCHVF APSNIEMAKI AQALSYGANY VAVDGTYDDA
NRIAAQIGDS RGIGIVNINM RSHYVEGSKT LAYEVAEQLD WNVPDQLIVP VGSGAMLNAI
CKGFEELQQV SLLDDVSNMH MIAAQPHGCA PVVDAFKKNS KDVIPVENPD TVAKSLAIGD
PGDGRYVLKR LEQYNGFAEE CNNKEILDAI LLLAKTEGIF TEPAGGVSVS VLQKMVEQGK
IDKNDKVVCY VTGNGLKATE SIMEVLEKPN VLKADISEVS AVVN