Gene Nmar_0911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0911 
Symbol 
ID5774454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp794959 
End bp796308 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content35% 
IMG OID641316550 
Productanthranilate synthase 
Protein accessionYP_001582245 
Protein GI161528419 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACCT TTGGAAAGAC TCAAGCAAAA GTAATACCCC TAGATTTATC TGAAAACCAG 
TTTCAAATTT ACAATAAAAT TTCAAGAAAT TATTCACATT CATTTCTATT TGAATCACTT
ACCGGTCCCG AAGTTTTAGC TGAAACATCA GTGATGGGTT TTGACCCAAA AGTAATTCTC
AAGGGATATT CAGACAAGGT AGAGATAATT CAAGAAGGCA AAACAAAGAC CATACAAACT
AGCGACCCAT TTTCAGAATT AAAAAAACTA CTTGGAAAAT CAGATGATCA AAGTTACAGA
TATCTCGGAG GGGCAGTAGG AGTTGTAAAT TATGATGCAA TTAGAATGGT AGAAGACATT
CCAGATACTC ATAATTCACC TCAACCATTA ATGGAATTTG GAATTTATGA TGATGGGTTG
TTATATGACA ATGTACACAA AAAATTATTT TATTTCTATC ATGATGAAAA CAGATTTGAA
AAATTAGTTA TGAGTGATGA TGAGTTTGAA GAATTTCATT CAAGTGAGGT CACACCAAAC
ATGGACAAGA CAAAATTTTC AGAAATCGTA AACAAAGCAA AAGAGTACAT TCATGATGGG
GACATATTCC AAGTAGTGCT ATCACGCAAG TTTGCATTTG ACACATCTGG AGATAATCTT
ACATTATACA AAACACTAAG AAAATTAAAT CCATCACCAT ACATGTATCA TCTAAAACAA
GAAAATAAAA CAATAATTGG TGCATCTCCA GAAATGTTAG TTAGAATTAC AGATGATAAA
GTAGAAACAT TCCCCATTGC AGGAACTAGA AAAATTACAG ATGATGAAGA GAAAAATAAG
CAACTAGCTG AAGAGTTAAT CCATGATGAA AAAGAGTTAG CAGAACACAC AATGCTTGTA
GATTTGGGAA GAAATGATAT TGGACGAGTT TGCAAATATG GAACAGTTCA TCCAGAATCA
TTAATGGAGA TAAAGAGATT CAGTCATGTC CAACATATTG TCAGCCATGT TGTAGGCAAT
TTGGCGCCTG AAAATGACAT GTTTGATGCA TTTCAGGCAG TATTTCCAGC AGGAACAGTA
TCAGGGGCAC CCAAGGTAAG AGCCATGGAG ATTATAGATG AGTTAGAGAC TGAGTCCAGA
GGCCCATATG CAGGTGCAGT CGGATATTTT TCATACAATG GATGTTGTGA TTTTGCTATT
GCAATTAGAA GCATATTCAT CGAAGACGGA AAGGGATTTG TCCAATCAGG TGCAGGAATT
GTTTCTGATT CAATCCCAGA AAATGAATTC AAAGAGACAG AGCACAAAGC AGGTGCAATG
CTGCAAGCAT TAAAGGAGGC ATCATCATGA
 
Protein sequence
MDTFGKTQAK VIPLDLSENQ FQIYNKISRN YSHSFLFESL TGPEVLAETS VMGFDPKVIL 
KGYSDKVEII QEGKTKTIQT SDPFSELKKL LGKSDDQSYR YLGGAVGVVN YDAIRMVEDI
PDTHNSPQPL MEFGIYDDGL LYDNVHKKLF YFYHDENRFE KLVMSDDEFE EFHSSEVTPN
MDKTKFSEIV NKAKEYIHDG DIFQVVLSRK FAFDTSGDNL TLYKTLRKLN PSPYMYHLKQ
ENKTIIGASP EMLVRITDDK VETFPIAGTR KITDDEEKNK QLAEELIHDE KELAEHTMLV
DLGRNDIGRV CKYGTVHPES LMEIKRFSHV QHIVSHVVGN LAPENDMFDA FQAVFPAGTV
SGAPKVRAME IIDELETESR GPYAGAVGYF SYNGCCDFAI AIRSIFIEDG KGFVQSGAGI
VSDSIPENEF KETEHKAGAM LQALKEASS