Gene Nmar_0548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0548 
Symbol 
ID5773778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp488310 
End bp489578 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content34% 
IMG OID641316181 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001581882 
Protein GI161528056 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGTA AAGTAGAAAA ATCAAAAATT TCAGGACAAA TTGTTTGTCC TTCAAACAAG 
AGCTATACTC ATAGAGCAAT ATTTCTTGCA TCACTTGCAG GAAATGGCAG CAAGGTGGAA
AATGTACTAT TATCAGCAGA CACCATGGCA ACAGTTGAAG CATGTAAAAA ATTTGGAGCA
TCCATTGAGA TTGAAAATTC ATCAATAATT GTAAAAAATC CCATAAAATT TGACAAAATC
GTGCCTGAAA TCAATACTGA AAATTCAGGA ACCACAATAA GAATAGCCTC AGGAATCGCT
AGTTTGTTTT CAGAAGAGAT TACGTTAACA GGGGATGAGA GTCTTCAAAA AAGACCCATG
CAGCCTCTCT TAGACGCACT ATCAAGTATT GGAGCACAAT GCCAATCAAC TGATGGAAAA
CCACCAATCA AAATTACAGG AAAGATTTCA GGTGGAGATG TTACAATTCC AGGAAACTTT
TCTAGTCAAT TCATTTCTGC ATTATTAATC AGTGCGCCAT TGACTGAAAA GGGAATCAAT
CTTTCAATTA AAGATAATCT AGTATCAAAA CCATATCTTG ATGCCACCAT TGCAACTATG
AGAAAGTTTG GAGTAAGCGT ACAAACATTA ATTCCATATA AAAGATACAA CATTTCACCT
CAAGTTTACA ATGCGGCAAC ATTTACAGTT CCAATTGATT TTTCTAGTCT TGCATTATTG
TTATCAGCAG CAGTACTTAA TGGAGATGAA ACTGTAATCA AAGGAAATAT TGGAAATTTA
CCACAAGGGG ATGAAGTCTT TATTGACATA CTAGAGCAAT TAGGAGTAAC TGTAAATATT
GGAGAAGATG AAATTAAAAT CAAATCTCCT GAAAAACTAA AAGGAGGAAG ATTTGATTTG
AGTAATTCTC CAGATCTTTT ACCACCACTA ACAATACTTG CATTAAATTC AGAAAATCCA
ATTGAGATTG TAAATGTAAA ACATGCAAGA CTAAAAGAGA CAGACAGAAT TGCAATAACA
TCAAGAGAGT TAGTTAAACT TGGAATTAAA GTTCAAGAAA ATGAAGATGG TTTGATTTTA
GAATCAACAG AGAATCTTAC CGGTGCAGAA TTAAATTCTG AAAATGACCA CAGACTATTC
ATGGCGTTTT GTATTGCAGG AATGTATGTT GGAAATTGTG TTGTAACAGA TCCTGAATCA
GTCCAAGTTT CTTATCCAGA TTTCGTCGAA GAGATGAATA GGATTGGAGC AAGAATTCAA
CCAGAATAA
 
Protein sequence
MKCKVEKSKI SGQIVCPSNK SYTHRAIFLA SLAGNGSKVE NVLLSADTMA TVEACKKFGA 
SIEIENSSII VKNPIKFDKI VPEINTENSG TTIRIASGIA SLFSEEITLT GDESLQKRPM
QPLLDALSSI GAQCQSTDGK PPIKITGKIS GGDVTIPGNF SSQFISALLI SAPLTEKGIN
LSIKDNLVSK PYLDATIATM RKFGVSVQTL IPYKRYNISP QVYNAATFTV PIDFSSLALL
LSAAVLNGDE TVIKGNIGNL PQGDEVFIDI LEQLGVTVNI GEDEIKIKSP EKLKGGRFDL
SNSPDLLPPL TILALNSENP IEIVNVKHAR LKETDRIAIT SRELVKLGIK VQENEDGLIL
ESTENLTGAE LNSENDHRLF MAFCIAGMYV GNCVVTDPES VQVSYPDFVE EMNRIGARIQ
PE