Gene Nmar_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1099 
Symbol 
ID5774109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1002328 
End bp1003491 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content37% 
IMG OID641316741 
Product2-methylcitrate synthase/citrate synthase II 
Protein accessionYP_001582433 
Protein GI161528607 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCA AAAATATTGG CTTGCGAGGA ATTGAAGTCG CAGATACAAG AATTTCAAAT 
ATTGACGGTG AAAAAGGCAA ATTAATCTAC AGAGGATTTG ACATTTTAGA TCTTACAAAA
AATTCAACAT TTGAAGAGAC AGCATATTTG CTGCTTTATG ACAATCTACC AACAAAGGCA
CAGTTAGATG AATTTAATCA AAAACTAGTT GAAGCACGAT ACATCCCAAA ACAGATGCAA
AAGAATATGG GAAATTGGAG AAAGGATGCA GATCCAATGG ACATGCTTCA GGCATTTGTC
TCAGCACTGG CAGGATACTA CGATGAGGAA TTTTCAAACA AGGAAGCAAG TTATGAAAAA
GCAATCAATC TAATTGCCAA AGTTCCAACA ATAATTGCAA GTTGGCAACG AATTAGAAAT
GGACTACCAA TTGTAGACCC AGATTCATCT CTTAGTCATG CAGCAAATTT CCTTTACATG
ATGTCAGGAG AAAAACCAGA TCCTGAAGTA GAGAAGGTTT TTGACGTGTG TCTAATTTTG
CATGCAGACC ACACATTCAA TGCATCTACA TTTACAGCTA GACAAGTTGC ATCAACAAGA
GCTCACATGT ATTCAGCATC AAGTGCAGCA ATTGGTGCAC TAAGTGGAGA ACTACACGGA
GGGGCAAACA CCGAAGTCAT GAAGATGTTA CTAGAAATCA AAGAAATTGA CAAAGTAGAA
CCATGGATTA AAGAAAAAAT GAGTGCAGGT GAAAGAATTA TGGGAATGGG TCATGCAGTT
TACAGAACAT ATGATCCTAG AGCACAAGTT CTAAAAGAGC TCTCAAGAAA ACTTGCAGAA
AAAACAAAAG AGCCATGGTT TGATATGACT GAAAAAGTTG AAACTACAAC CATTTCTGAA
ATGAAAGCAC AGAAAGGAAA AGACATCTAT CCAAATGTCG ATTTGTATAG TGCATCAATT
TACTATATGT TAAAAATTCC AGTAGATTTG AACACACCAA TATTTGCAAT ATCAAGAGTC
GTAGGATGGG CAGCTCATAT TATTGAAGAA AAGTTTGCAG AAGCTGCACC AAAACCAGCA
TTGTATAGAC CAAAAGCAAC GTATGTTGGA AAGTATTGTG GTCCAGAAGG TTGTGAATAC
AAAACACTAG ACTTGAGAAA ATAA
 
Protein sequence
METKNIGLRG IEVADTRISN IDGEKGKLIY RGFDILDLTK NSTFEETAYL LLYDNLPTKA 
QLDEFNQKLV EARYIPKQMQ KNMGNWRKDA DPMDMLQAFV SALAGYYDEE FSNKEASYEK
AINLIAKVPT IIASWQRIRN GLPIVDPDSS LSHAANFLYM MSGEKPDPEV EKVFDVCLIL
HADHTFNAST FTARQVASTR AHMYSASSAA IGALSGELHG GANTEVMKML LEIKEIDKVE
PWIKEKMSAG ERIMGMGHAV YRTYDPRAQV LKELSRKLAE KTKEPWFDMT EKVETTTISE
MKAQKGKDIY PNVDLYSASI YYMLKIPVDL NTPIFAISRV VGWAAHIIEE KFAEAAPKPA
LYRPKATYVG KYCGPEGCEY KTLDLRK