Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0547 |
Symbol | |
ID | 5773890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 486886 |
End bp | 487983 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316180 |
Product | chorismate synthase |
Protein accession | YP_001581881 |
Protein GI | 161528055 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCAG GAAGTTCTAT TGGTCAGCGC CTTGTGTTGA CAAGTTTTGG AGAGAGTCAT GGAAAATCTA TTGGGGCAGT TTTAGACGGA TGTCCTGCAG GATTAGAGAT TGATGAAAAA GATATTCAAA AAATGTTAGA CCAAAGAAAA CCAGGTCAAA ATCTAATTTC AACACAAAGA AAAGAAGGAG ACGTTGTAGA GATTATCTCA GGAGTGTTTA GAGGACATAC AACCGGAGCT CCAATAACAA TGGTAATTTG GAATAGTGAT CAAAAATCAA AAGATTATGA AAATTTGAAA ACAAAACTCA GACCAGGACA TTCAGACTAT CCTGCTATGA TGAAATATAA TCAATATAAT GACCACCGAG GAGGAGGACG ATTTTCAGGA AGATTAACTG CTACACATGT AATGGGCGGT GCGATTGCAC GTAAACTTCT CAAAGTTACA TTAGGTATTG AAACAAATTC TTACACATCT CAAATTGGAA AAATAAAGAT GGAAAGACAA TTCAATGAAA AAATGATAAG TTCAATTTAC AAAAACGAAG TAAGGTGTCC TGAAACAAAA ACTGCAAAAA TGATGAGAGC AAGTATTTTG GATGCAAGAA AAAAAGGAGA TTCATTAGGA GGAATCATTG AATCAATTAC AACAAATGTA CCAGTTGGTT TAGGAGAACC AATTTTTAGT TCACTAGAAT CAGATTTGAG TAAAGCAATG TTTTCTATTC CATCAGTAAA AGGAGTAGAG TTTGGTTCAG GATTCAAAGG TTCAGAGATG TACGGTTCAG AAAATAATGA TTTGTACACC ATAAAAAGAG GAAAAATTGT TACAAAGACA AACAATTCAG GCGGAATATT AGGTGGAATC TCAAATGGCA TGCCCATTAC CATGAGAGTA GCATTCAAGC CCGCATCATC AATTTCACAA AAACAAAGTA CTGTTGACAT CAAGACCAAA AAAGAAACCA CACTTCAAGT GAAAGGAAGA CACGATCCAT GTGTCGTTCC AAGAGCACCA CCAGTAGTTG ATTCTCTTGT AGCACTAACA ATTGCAGATC ATGCACTAAT TTCGGGTCAA ATCAAGCCTG TTTTGTAA
|
Protein sequence | MLSGSSIGQR LVLTSFGESH GKSIGAVLDG CPAGLEIDEK DIQKMLDQRK PGQNLISTQR KEGDVVEIIS GVFRGHTTGA PITMVIWNSD QKSKDYENLK TKLRPGHSDY PAMMKYNQYN DHRGGGRFSG RLTATHVMGG AIARKLLKVT LGIETNSYTS QIGKIKMERQ FNEKMISSIY KNEVRCPETK TAKMMRASIL DARKKGDSLG GIIESITTNV PVGLGEPIFS SLESDLSKAM FSIPSVKGVE FGSGFKGSEM YGSENNDLYT IKRGKIVTKT NNSGGILGGI SNGMPITMRV AFKPASSISQ KQSTVDIKTK KETTLQVKGR HDPCVVPRAP PVVDSLVALT IADHALISGQ IKPVL
|
| |