Gene Nmul_A0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0036 
Symbol 
ID3784025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp35230 
End bp36420 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content56% 
IMG OID637810105 
ProductS-adenosylmethionine synthetase 
Protein accessionYP_410737 
Protein GI82701171 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0192] S-adenosylmethionine synthetase 
TIGRFAM ID[TIGR01034] S-adenosylmethionine synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAC AGAATTTCAT GTTCACCTCC GAATCTGTAA CCGAAGGGCA TCCGGATAAG 
CTTTGCGATC AAGTGAGCGA TGCGTTGGTC GACCGCTTTC TGCAGCAAGA CCCCTTTTCG
AAGGTGATAG CGGAGTGCGC AGTATCCACG GGAATACTTT TTATCGCAGC CCGCTTCGCT
TCGTCGGCTT CTGTCGACAT CCCCGAGGTT GCGCGCCGGG TTATACACCA GCTTGGCTAC
GAGCAAGCGG ATTTCAATGC AAGAGATTGC ACGATCATGA CGAGCCTGAG CGAACTGCCG
GCACCGTCTT ATCCGTTCAT AGATGAGAGG GAAATGAATG AGGAAGAGCT GGAATCCGTT
ACTGCCAAGA ACCAGGCCAA TGTATTCGGC TTTGCCTGTA ATCAGAGCTC GGCCTTTATG
CCGCTGCCCA TCTCGCTTTC GCATAAACTG GCGCGTCGGC TTACGGCCGC ACGTTTTCAG
AAGCAGATCC CCTATCTTGC GCCCGATGGA AAAACGCAGG TAGGGGTGGA ATATCGCGAA
GGCAGACCGT ATCGCATTCA TTCCATCACC ATCATTGCCG CTCAGCAGAA GACCGCCATG
CGCGGCCTCG CACCGCTGCG CGATGATCTC AACGCGCATG TGATTGAGCC GGTTTTCGCA
ACGGAGACGC TGCGGCCCGA CTCCCGCACC CATATTTTCA TCAATCCGGA AGGGCTGGTC
GCCGACGGAG GACCGTCGCT ACATTCCGGC TTGACTGGCC GCAAGAACGC AGTGGATACA
TACGGTGAAT ATTCCCGGCA TTCGGAATCC GCGCTTTCCG GCAAGGACCC TTCTCGTATC
GATCGCGTGG GAGCGTATGC AGCGCGCTAC GCGGCCAAGA ATGTAGTCGC CGCCGGCCTC
GCCCAGGCAT GCGAGGTACA TCTGGCGTAT TCGATAGGAA TATCGAGGCC GGTAAGCGTG
CAGGCGGATA CTTTTGGTAC AGGGAGCGTT CCTGATACCG AAATAACGGC GCGTATTCTC
GACCAGTTTG ATTTCCGCCC TGCCGGGATC ATCCGCGCCT TTAATTTGCG CTATCAGCCT
CAACTATTTC GCGGTTTGTT TTACCGCAAG CTTGCGGTTT ACGGGCAGGT GGGACGGATG
GATATCGGAT TGCCGTGGGA GAACACGGAC AAGGCTGCAT TGCTGCGCTA G
 
Protein sequence
MMKQNFMFTS ESVTEGHPDK LCDQVSDALV DRFLQQDPFS KVIAECAVST GILFIAARFA 
SSASVDIPEV ARRVIHQLGY EQADFNARDC TIMTSLSELP APSYPFIDER EMNEEELESV
TAKNQANVFG FACNQSSAFM PLPISLSHKL ARRLTAARFQ KQIPYLAPDG KTQVGVEYRE
GRPYRIHSIT IIAAQQKTAM RGLAPLRDDL NAHVIEPVFA TETLRPDSRT HIFINPEGLV
ADGGPSLHSG LTGRKNAVDT YGEYSRHSES ALSGKDPSRI DRVGAYAARY AAKNVVAAGL
AQACEVHLAY SIGISRPVSV QADTFGTGSV PDTEITARIL DQFDFRPAGI IRAFNLRYQP
QLFRGLFYRK LAVYGQVGRM DIGLPWENTD KAALLR