Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0548 |
Symbol | |
ID | 5773778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 488310 |
End bp | 489578 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641316181 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001581882 |
Protein GI | 161528056 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTGTA AAGTAGAAAA ATCAAAAATT TCAGGACAAA TTGTTTGTCC TTCAAACAAG AGCTATACTC ATAGAGCAAT ATTTCTTGCA TCACTTGCAG GAAATGGCAG CAAGGTGGAA AATGTACTAT TATCAGCAGA CACCATGGCA ACAGTTGAAG CATGTAAAAA ATTTGGAGCA TCCATTGAGA TTGAAAATTC ATCAATAATT GTAAAAAATC CCATAAAATT TGACAAAATC GTGCCTGAAA TCAATACTGA AAATTCAGGA ACCACAATAA GAATAGCCTC AGGAATCGCT AGTTTGTTTT CAGAAGAGAT TACGTTAACA GGGGATGAGA GTCTTCAAAA AAGACCCATG CAGCCTCTCT TAGACGCACT ATCAAGTATT GGAGCACAAT GCCAATCAAC TGATGGAAAA CCACCAATCA AAATTACAGG AAAGATTTCA GGTGGAGATG TTACAATTCC AGGAAACTTT TCTAGTCAAT TCATTTCTGC ATTATTAATC AGTGCGCCAT TGACTGAAAA GGGAATCAAT CTTTCAATTA AAGATAATCT AGTATCAAAA CCATATCTTG ATGCCACCAT TGCAACTATG AGAAAGTTTG GAGTAAGCGT ACAAACATTA ATTCCATATA AAAGATACAA CATTTCACCT CAAGTTTACA ATGCGGCAAC ATTTACAGTT CCAATTGATT TTTCTAGTCT TGCATTATTG TTATCAGCAG CAGTACTTAA TGGAGATGAA ACTGTAATCA AAGGAAATAT TGGAAATTTA CCACAAGGGG ATGAAGTCTT TATTGACATA CTAGAGCAAT TAGGAGTAAC TGTAAATATT GGAGAAGATG AAATTAAAAT CAAATCTCCT GAAAAACTAA AAGGAGGAAG ATTTGATTTG AGTAATTCTC CAGATCTTTT ACCACCACTA ACAATACTTG CATTAAATTC AGAAAATCCA ATTGAGATTG TAAATGTAAA ACATGCAAGA CTAAAAGAGA CAGACAGAAT TGCAATAACA TCAAGAGAGT TAGTTAAACT TGGAATTAAA GTTCAAGAAA ATGAAGATGG TTTGATTTTA GAATCAACAG AGAATCTTAC CGGTGCAGAA TTAAATTCTG AAAATGACCA CAGACTATTC ATGGCGTTTT GTATTGCAGG AATGTATGTT GGAAATTGTG TTGTAACAGA TCCTGAATCA GTCCAAGTTT CTTATCCAGA TTTCGTCGAA GAGATGAATA GGATTGGAGC AAGAATTCAA CCAGAATAA
|
Protein sequence | MKCKVEKSKI SGQIVCPSNK SYTHRAIFLA SLAGNGSKVE NVLLSADTMA TVEACKKFGA SIEIENSSII VKNPIKFDKI VPEINTENSG TTIRIASGIA SLFSEEITLT GDESLQKRPM QPLLDALSSI GAQCQSTDGK PPIKITGKIS GGDVTIPGNF SSQFISALLI SAPLTEKGIN LSIKDNLVSK PYLDATIATM RKFGVSVQTL IPYKRYNISP QVYNAATFTV PIDFSSLALL LSAAVLNGDE TVIKGNIGNL PQGDEVFIDI LEQLGVTVNI GEDEIKIKSP EKLKGGRFDL SNSPDLLPPL TILALNSENP IEIVNVKHAR LKETDRIAIT SRELVKLGIK VQENEDGLIL ESTENLTGAE LNSENDHRLF MAFCIAGMYV GNCVVTDPES VQVSYPDFVE EMNRIGARIQ PE
|
| |