Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0911 |
Symbol | |
ID | 5774454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 794959 |
End bp | 796308 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316550 |
Product | anthranilate synthase |
Protein accession | YP_001582245 |
Protein GI | 161528419 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACACCT TTGGAAAGAC TCAAGCAAAA GTAATACCCC TAGATTTATC TGAAAACCAG TTTCAAATTT ACAATAAAAT TTCAAGAAAT TATTCACATT CATTTCTATT TGAATCACTT ACCGGTCCCG AAGTTTTAGC TGAAACATCA GTGATGGGTT TTGACCCAAA AGTAATTCTC AAGGGATATT CAGACAAGGT AGAGATAATT CAAGAAGGCA AAACAAAGAC CATACAAACT AGCGACCCAT TTTCAGAATT AAAAAAACTA CTTGGAAAAT CAGATGATCA AAGTTACAGA TATCTCGGAG GGGCAGTAGG AGTTGTAAAT TATGATGCAA TTAGAATGGT AGAAGACATT CCAGATACTC ATAATTCACC TCAACCATTA ATGGAATTTG GAATTTATGA TGATGGGTTG TTATATGACA ATGTACACAA AAAATTATTT TATTTCTATC ATGATGAAAA CAGATTTGAA AAATTAGTTA TGAGTGATGA TGAGTTTGAA GAATTTCATT CAAGTGAGGT CACACCAAAC ATGGACAAGA CAAAATTTTC AGAAATCGTA AACAAAGCAA AAGAGTACAT TCATGATGGG GACATATTCC AAGTAGTGCT ATCACGCAAG TTTGCATTTG ACACATCTGG AGATAATCTT ACATTATACA AAACACTAAG AAAATTAAAT CCATCACCAT ACATGTATCA TCTAAAACAA GAAAATAAAA CAATAATTGG TGCATCTCCA GAAATGTTAG TTAGAATTAC AGATGATAAA GTAGAAACAT TCCCCATTGC AGGAACTAGA AAAATTACAG ATGATGAAGA GAAAAATAAG CAACTAGCTG AAGAGTTAAT CCATGATGAA AAAGAGTTAG CAGAACACAC AATGCTTGTA GATTTGGGAA GAAATGATAT TGGACGAGTT TGCAAATATG GAACAGTTCA TCCAGAATCA TTAATGGAGA TAAAGAGATT CAGTCATGTC CAACATATTG TCAGCCATGT TGTAGGCAAT TTGGCGCCTG AAAATGACAT GTTTGATGCA TTTCAGGCAG TATTTCCAGC AGGAACAGTA TCAGGGGCAC CCAAGGTAAG AGCCATGGAG ATTATAGATG AGTTAGAGAC TGAGTCCAGA GGCCCATATG CAGGTGCAGT CGGATATTTT TCATACAATG GATGTTGTGA TTTTGCTATT GCAATTAGAA GCATATTCAT CGAAGACGGA AAGGGATTTG TCCAATCAGG TGCAGGAATT GTTTCTGATT CAATCCCAGA AAATGAATTC AAAGAGACAG AGCACAAAGC AGGTGCAATG CTGCAAGCAT TAAAGGAGGC ATCATCATGA
|
Protein sequence | MDTFGKTQAK VIPLDLSENQ FQIYNKISRN YSHSFLFESL TGPEVLAETS VMGFDPKVIL KGYSDKVEII QEGKTKTIQT SDPFSELKKL LGKSDDQSYR YLGGAVGVVN YDAIRMVEDI PDTHNSPQPL MEFGIYDDGL LYDNVHKKLF YFYHDENRFE KLVMSDDEFE EFHSSEVTPN MDKTKFSEIV NKAKEYIHDG DIFQVVLSRK FAFDTSGDNL TLYKTLRKLN PSPYMYHLKQ ENKTIIGASP EMLVRITDDK VETFPIAGTR KITDDEEKNK QLAEELIHDE KELAEHTMLV DLGRNDIGRV CKYGTVHPES LMEIKRFSHV QHIVSHVVGN LAPENDMFDA FQAVFPAGTV SGAPKVRAME IIDELETESR GPYAGAVGYF SYNGCCDFAI AIRSIFIEDG KGFVQSGAGI VSDSIPENEF KETEHKAGAM LQALKEASS
|
| |