Gene Nmul_A2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2541 
Symbol 
ID3784046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2909986 
End bp2911185 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content56% 
IMG OID637812632 
ProductS-adenosylmethionine synthetase 
Protein accessionYP_413222 
Protein GI82703656 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0192] S-adenosylmethionine synthetase 
TIGRFAM ID[TIGR01034] S-adenosylmethionine synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.245618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAT ATCTTTTTAC CTCCGAATCC GTGTCTGAAG GTCATCCCGA TAAGGTGGCC 
GATCAAATTT CCGATTCCAT CCTCGACGCC ATCCTGGCTC AGGATCCCAA TGCACGGGTA
GCTTGCGAAA CGTTATGCAG TACCGGTCTG ATCGTCATGT CGGGCGAAAT CACCACTCAG
GCCAATGTGG ACTACATGCA GGTCGCGCGC GCCGCGGTCA AGCGCATCGG TTACAACAGT
TCCGATATCG GCTTTGACTA TAATACCTGT GCAGTGCTGA CTGCTTTCAA TAAGCAATCC
CCTGATATCG CGCGCGGGGT CAACCGCACC AAGGAAGAGG AAATGGATCA GGGGGCGGGT
GACCAGGGTC TCATGTTCGG CTACGCCTGC GATGAGACGC CGCAATTGAT GCCCATGCCG
ATCTACTATG CCCACCGGTT GATGGAGCGC CAGGCGGAAC TGCGGAAAGA CGGACGCCTC
CCGTGGCTGC GTCCGGATGC AAAATCGCAG GTTTCAGTAC GTTACCTGGA CGGCAAGCCG
CAGCGAATCG AGACCGTTGT CATTTCCACA CAACATCACC CGGACATCAG TCACACCGAT
TTGAGCGAGG CCATTATTGA GGAAGTCATC AAACCCGTGC TTCCGAAAAA AATGCTGAGT
GGCGAAACGC GGTATCTGAT CAACCCCACA GGACGTTTCG TAGTGGGCGG CCCCATGGGA
GATTGCGGAC TGACCGGACG GAAGATTATC GTCGACAGCT ATGGAGGTAC CGCGCACCAT
GGCGGGGGCG CCTTCTCCGG CAAGGATCCG TCCAAAGTAG ATCGTTCCGC TGCTTATGCC
GGACGATATG TGGCAAAGAA TCTTGTTGCG GCCGGTATCG CCAGTAGATG CGAGGTACAG
ATGGCTTATG CGATCGGAGT TGCCCGCCCC GTTTCGCTGA TGGTGGACAC TTTTGGAACC
GGAAAAATTC CGGACGATAA AATCGTCAAA CTGATTGAGC GGCACTTCGA TCTGCGTCCG
CGCGGCATCA TCCACGGCCT TGATCTGCTG CGTCCCATCT ACGAGAAGAC AGCTGCCTAT
GGCCATTTTG GCCGGGACGA ACCGGAATTC AGCTGGGAAT CGACGGATAA GGCTGCACAG
TTGCGCGAGG AAGCGGGAAT TGAACCCGCG GAAACCGAAC CGCTCAGTCT TCAGGCTTAA
 
Protein sequence
MSEYLFTSES VSEGHPDKVA DQISDSILDA ILAQDPNARV ACETLCSTGL IVMSGEITTQ 
ANVDYMQVAR AAVKRIGYNS SDIGFDYNTC AVLTAFNKQS PDIARGVNRT KEEEMDQGAG
DQGLMFGYAC DETPQLMPMP IYYAHRLMER QAELRKDGRL PWLRPDAKSQ VSVRYLDGKP
QRIETVVIST QHHPDISHTD LSEAIIEEVI KPVLPKKMLS GETRYLINPT GRFVVGGPMG
DCGLTGRKII VDSYGGTAHH GGGAFSGKDP SKVDRSAAYA GRYVAKNLVA AGIASRCEVQ
MAYAIGVARP VSLMVDTFGT GKIPDDKIVK LIERHFDLRP RGIIHGLDLL RPIYEKTAAY
GHFGRDEPEF SWESTDKAAQ LREEAGIEPA ETEPLSLQA