Gene Nmul_A1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1120 
Symbol 
ID3785700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1291624 
End bp1292820 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID637811205 
Productpeptidase M23B 
Protein accessionYP_411815 
Protein GI82702249 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG4942] Membrane-bound metallopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCAAG CACCTGTGAC CGCAGAGCCA TCCGATAGTG AAGAGTTGAA ACAACTGAGA 
AACAAGATCG AGACGCTGGA AAAGGAACTC ACGGATACGG AGGGATACAG ATCCGAAGCG
GCGGGAGCGC TGCGCGAATC GGAAAAAGCG ATCGATGTCG CTAACCGGAG GCTTGCGGAA
CTCGCCAAGC AGCGGCGTGC CGCGAACAGC AAGCTTGGTC AACTGAAAGC GCAGTCGGCC
CAGATCAAGA AGGAAATAGC TGCGCAGCAA CTGCAGTTGA GGAACCTGCT TTACGGGCTC
TATATCGCCG GAGGCGGACG AAAAGAGTAC TTGAGTCTCT TGTTGAGCCA GCGGAATCCT
AATGAAATAG CCCGCAATCT CCACTATTAC GAATATTTCT CACGCGCTCG CAAGGAAGGT
ATCGATGGCC TGCGTGCGAA TCTTGAGAAG CTCAATACGC TCAGCCTCGC CAGCCGGGAA
AAAAGCGCGG AGATCTCGGG TGTGCATGCA CGCCACGCTG AGCAGAAAAT ACAGCTCGAA
CAACAAAAAG ACAGCCATGC GAAGCTTCTG GCAGAAATAT CACTGCAAGC CGAAGAGCAG
CGGCGCGAAA TCAACCGCCT CAGGCGTAAT GAAGACCGCC TCACCAGGCT GGTGGAAAAA
CTTACCAGGA TGCTTGCGAA GAAAAAGAAA TTCGAATCAT CTGAAAAACC TTCCGAGCCA
GGCGCACCCC CTACTCCTGC CAGCAACAGC GGTTCCCCCG ATTTATCTGA GAATGCGACG
CCATTCACGA ATGGAACGTC CTTCTCCTCC CTTCGCGGCC ACCTGAATTC ACCGGTGCGC
GGGGAACTTG CAAACCGCTT CGGCAGTCCA CGTGCAGATG GTGGCGTTAC CTGGAAAGGG
TTGTTCATCC GCGCCGCTGG CGGCGAGAGT GTGAAGGCGA TTGCAAATGG ACGCGTCGTA
TTCGCCGACT GGCTCAGAGG GTTCGGCAAC CTGATGATTC TGGACCACGG CGACAACTAT
ATGAGTCTTT ATGGAAATAA TGAAGCAGTC CATAAGCGGG TGGGAGACGT GATCAATGCC
GGAGAGACGA TAGCCACAGT AGGCAATAGC AGTGGAAATT CCGATACCGG CCTATACTTC
GAATTGCGCC ATCAGGGCAA ACCGTTTGAC CCTTTGAACT GGGTCAGAAT AAAATGA
 
Protein sequence
MWQAPVTAEP SDSEELKQLR NKIETLEKEL TDTEGYRSEA AGALRESEKA IDVANRRLAE 
LAKQRRAANS KLGQLKAQSA QIKKEIAAQQ LQLRNLLYGL YIAGGGRKEY LSLLLSQRNP
NEIARNLHYY EYFSRARKEG IDGLRANLEK LNTLSLASRE KSAEISGVHA RHAEQKIQLE
QQKDSHAKLL AEISLQAEEQ RREINRLRRN EDRLTRLVEK LTRMLAKKKK FESSEKPSEP
GAPPTPASNS GSPDLSENAT PFTNGTSFSS LRGHLNSPVR GELANRFGSP RADGGVTWKG
LFIRAAGGES VKAIANGRVV FADWLRGFGN LMILDHGDNY MSLYGNNEAV HKRVGDVINA
GETIATVGNS SGNSDTGLYF ELRHQGKPFD PLNWVRIK