Gene Nmar_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0547 
Symbol 
ID5773890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp486886 
End bp487983 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content36% 
IMG OID641316180 
Productchorismate synthase 
Protein accessionYP_001581881 
Protein GI161528055 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCAG GAAGTTCTAT TGGTCAGCGC CTTGTGTTGA CAAGTTTTGG AGAGAGTCAT 
GGAAAATCTA TTGGGGCAGT TTTAGACGGA TGTCCTGCAG GATTAGAGAT TGATGAAAAA
GATATTCAAA AAATGTTAGA CCAAAGAAAA CCAGGTCAAA ATCTAATTTC AACACAAAGA
AAAGAAGGAG ACGTTGTAGA GATTATCTCA GGAGTGTTTA GAGGACATAC AACCGGAGCT
CCAATAACAA TGGTAATTTG GAATAGTGAT CAAAAATCAA AAGATTATGA AAATTTGAAA
ACAAAACTCA GACCAGGACA TTCAGACTAT CCTGCTATGA TGAAATATAA TCAATATAAT
GACCACCGAG GAGGAGGACG ATTTTCAGGA AGATTAACTG CTACACATGT AATGGGCGGT
GCGATTGCAC GTAAACTTCT CAAAGTTACA TTAGGTATTG AAACAAATTC TTACACATCT
CAAATTGGAA AAATAAAGAT GGAAAGACAA TTCAATGAAA AAATGATAAG TTCAATTTAC
AAAAACGAAG TAAGGTGTCC TGAAACAAAA ACTGCAAAAA TGATGAGAGC AAGTATTTTG
GATGCAAGAA AAAAAGGAGA TTCATTAGGA GGAATCATTG AATCAATTAC AACAAATGTA
CCAGTTGGTT TAGGAGAACC AATTTTTAGT TCACTAGAAT CAGATTTGAG TAAAGCAATG
TTTTCTATTC CATCAGTAAA AGGAGTAGAG TTTGGTTCAG GATTCAAAGG TTCAGAGATG
TACGGTTCAG AAAATAATGA TTTGTACACC ATAAAAAGAG GAAAAATTGT TACAAAGACA
AACAATTCAG GCGGAATATT AGGTGGAATC TCAAATGGCA TGCCCATTAC CATGAGAGTA
GCATTCAAGC CCGCATCATC AATTTCACAA AAACAAAGTA CTGTTGACAT CAAGACCAAA
AAAGAAACCA CACTTCAAGT GAAAGGAAGA CACGATCCAT GTGTCGTTCC AAGAGCACCA
CCAGTAGTTG ATTCTCTTGT AGCACTAACA ATTGCAGATC ATGCACTAAT TTCGGGTCAA
ATCAAGCCTG TTTTGTAA
 
Protein sequence
MLSGSSIGQR LVLTSFGESH GKSIGAVLDG CPAGLEIDEK DIQKMLDQRK PGQNLISTQR 
KEGDVVEIIS GVFRGHTTGA PITMVIWNSD QKSKDYENLK TKLRPGHSDY PAMMKYNQYN
DHRGGGRFSG RLTATHVMGG AIARKLLKVT LGIETNSYTS QIGKIKMERQ FNEKMISSIY
KNEVRCPETK TAKMMRASIL DARKKGDSLG GIIESITTNV PVGLGEPIFS SLESDLSKAM
FSIPSVKGVE FGSGFKGSEM YGSENNDLYT IKRGKIVTKT NNSGGILGGI SNGMPITMRV
AFKPASSISQ KQSTVDIKTK KETTLQVKGR HDPCVVPRAP PVVDSLVALT IADHALISGQ
IKPVL