Gene Nmul_A1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1854 
Symbol 
ID3786596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2137370 
End bp2138278 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content54% 
IMG OID637811939 
ProductN5-glutamine S-adenosyl-L-methionine-dependent methyltransferase 
Protein accessionYP_412541 
Protein GI82702975 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03533] protein-(glutamine-N5) methyltransferase, ribosomal protein L3-specific
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCATTG AAGCCAAAAA CCAGCTCCAG ACCATCCGCG ACGTACTGCG TTTCGCCATA 
AGCCGCTTTA ACGATGCAGG GCTTCATTTT GGTCATGGCT CTGCCTCGGC GTATGACGAA
GCAGCTTATC TCATTCTCCA CACGCTGCAT CTTCCACTTG ACCGGCTGGA ACCGTTTCTG
GATGCCCGCA TCCTTCCGGG CGAGCTGGAA CTGGTCCTGA AAATAATCGA GCGCCGTGCA
ACAGAAAAGA TTCCGGCTGC CTATCTGACG AGAGAAGCCT GGCTGGGGGA TTTCCATTTT
TATGTGGATG AACGCGTGAT CGTACCGCGC TCCTTCATTG CGGAATTGCT GCGAGAACAA
CTTGCGCCCT GGATGGAAGA GCCGGCAGAA GTTTATTCCG CATTGGATCT TTGCACCGGC
TCCGGATGTC TGGCGATACT GCTCGCACAT GCGTTTCCCA ATGCAGCCAT CGATGCGACA
GATATTTCAG CGAATGCCTT GCAGGTTGCT GAAAAAAATG TGGAGGAGTA TGGCCTGGAG
GACCGGATCG ATCTTATCCA GTCGGATCTA TTCGCGGCAT TGGCAGACCG CCGCTACGAT
CTTATTGTCA GCAATCCGCC CTATGTCAAC GCGGAAGCAA TGGCAGCGTT GCCGGAGGAA
TATCGCCATG AGCCGCAGAG TGCGCTTGCC AGCGGCGAGG ATGGACTGAA GGCGACAAAG
GTAATACTAC GGGACGCAGC AAACCATTTG ACCGCTGATG GGTTACTCAT TGTCGAAATC
GGTCATAACA GGGAAGCCCT GGAGCGTGCC TTTCCCGATA CACCTTTTAC CTGGCTGGAC
ACCAGTGCAG GCGATGAGTT TGTTTTCCTG CTGAAACGGG ACCAGCTTCC CAGGCATCAG
GCTTTGTAA
 
Protein sequence
MFIEAKNQLQ TIRDVLRFAI SRFNDAGLHF GHGSASAYDE AAYLILHTLH LPLDRLEPFL 
DARILPGELE LVLKIIERRA TEKIPAAYLT REAWLGDFHF YVDERVIVPR SFIAELLREQ
LAPWMEEPAE VYSALDLCTG SGCLAILLAH AFPNAAIDAT DISANALQVA EKNVEEYGLE
DRIDLIQSDL FAALADRRYD LIVSNPPYVN AEAMAALPEE YRHEPQSALA SGEDGLKATK
VILRDAANHL TADGLLIVEI GHNREALERA FPDTPFTWLD TSAGDEFVFL LKRDQLPRHQ
AL