Gene Nmul_A1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1798 
Symbol 
ID3786349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2054989 
End bp2056896 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content58% 
IMG OID637811884 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_412487 
Protein GI82702921 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA AGGATAAAAA GACCCAACTT CCCATGGACA AAAAAACCCA GATCAACTTC 
TGGTACGTCA TTATCGCGGT ACTCGGCATT CTGCTCATAC AAAGCATGTA TGCCCGCTAT
ACCAAGGTCG AGCCCATTCC TTATAGCCGC TTTCATACCC TGCTGGACGA GGACAAGATT
GCGGAGATAG CCATCACCGA AAATCATATT TATGGCACCC TGAAGGGGGA AGGTGCCGAC
GGGTTGAAGG ATTTCGTGAC CACGCGGGTG GAACCCGAGC TGGCTGATAA GCTCGATCAA
CATCACGTGA CGTATACGGG CGTGGTCCAG AGCACATGGA TGCGCGATCT GCTGTCGTGG
CTATTGCCGA TGGCGATTTT TTTCGGGATA TGGCTGTTTA TCATCCGCCG CATGAATCCG
GGCGGCATGA CGGGCGGGCT GATGTCGATC GGCAAGAGCC GCGCCAAAGT TTTTGTGGAG
AAAGAAACCA AGGTAACTTT TGCAGATGTG GCCGGAGTGG ATGAAGCCAA GGAAGAGTTG
GAAGAAGTCA TCAATTTTTT GAAGGATCCT GCCGGATACA GCCGCCTGGG CGGGCGGGTA
CCCAAGGGTA TCCTGCTGGT GGGGCCACCG GGTACGGGCA AGACGCTGCT GGCCCGTGCC
GTGGCAGGCG AAGCGAATGT TCCGTTCTTC TCGATCTCCG GTTCCGAGTT CGTAGAGATG
TTCGTCGGGG TCGGGGCCGC GCGCGTGCGC GACCTGTTTG AACAAGCGCG CCAGATGGCT
CCCGCCATCA TCTTTATCGA TGAGTTGGAT TCGCTTGGAC GAGCCCGGGG CGCCTATGGG
CTTGGGGGTC ATGACGAGAA GGAGCAGACG CTGAATCAAC TGCTGGCCGA GCTTGATGGT
TTCGACCCTA AAAGTGGCGT GGTGCTGCTC GCGGCGACCA ACCGGCCCGA AATCCTCGAC
CCCGCTCTGC TGCGGGCGGG GCGATTTGAC CGGCAGGTAC TGGTGGATCG GCCGGACAAA
GTGGGGCGAG AACAGATTCT TGCCGTACAT CTGAAGAAGG TGAAGCTCGA TCCTGACGTA
AAAAAAGAGC AGATTGCAGC TTTGACGCCG GGGTTTACGG GCGCCGATCT CGCCAACCTG
GTCAACGAAG CTGCCCTGCT AGCCACCCGC AGGAATGGCG CGGCCGTGAC GATGGGGGAT
TTCAATAACG CCATTTTGCG GGTGGTAGCC GGCCTTGAGA AGCGCAACCG CCTGCTCAAT
CCGGCAGAGC GCCGGGTGGT CGCGTTTCAT GAACTGGGAC ACGCGATGGT GGCACTCGCC
TTGCCTGGAA CCGACGCGGT ACACAAGGTT TCTATTATCC CGCGTGGAAT AGGCGCGCTC
GGCTATACAG TGCAGCGGCC GACCGAAGAC CGGTTTCTGA TGACCCGGGC AGAGCTGGAA
AACAAAATGG CGGTGATGAT GGGCGGACGA GCAGCCGAAC GTGTGGTATT CAACGAAATA
TCGACTGGCG CATCGGATGA CATCGTCCGC GCGACCGACC TTGCCCGTGC GATGGTGCTC
CGTTATGGAA TGACCGAGGC CCTCGGGAAT GTTGCGTATG ACCGCGAACG TTCGCAATTT
CTGCAACCCG GCATTCCCAT GCCGCAAAGC CGGGACTATA GCGAAGAGAC GGCAAACACG
GTTGACAGCA CTGTGCGTGC GCTCGTCGAT GGTGCATTGA AGCGGGCGAT AGAGATATTG
GAAAACAACC GCGCGCTGCT CGACCGGACG GCGGAGGAGT TGCTTCGGGT CGAAACGTTG
AATGAACCCG AGATAGAGAA CCTGAAACGG CAGATCACCG CCAGGCCCGT GCTTCCTCGC
ACCGACACGC CCCTCCCGGA AAAGGACAAA GCCCTGGAGA AAGCGTAG
 
Protein sequence
MDKKDKKTQL PMDKKTQINF WYVIIAVLGI LLIQSMYARY TKVEPIPYSR FHTLLDEDKI 
AEIAITENHI YGTLKGEGAD GLKDFVTTRV EPELADKLDQ HHVTYTGVVQ STWMRDLLSW
LLPMAIFFGI WLFIIRRMNP GGMTGGLMSI GKSRAKVFVE KETKVTFADV AGVDEAKEEL
EEVINFLKDP AGYSRLGGRV PKGILLVGPP GTGKTLLARA VAGEANVPFF SISGSEFVEM
FVGVGAARVR DLFEQARQMA PAIIFIDELD SLGRARGAYG LGGHDEKEQT LNQLLAELDG
FDPKSGVVLL AATNRPEILD PALLRAGRFD RQVLVDRPDK VGREQILAVH LKKVKLDPDV
KKEQIAALTP GFTGADLANL VNEAALLATR RNGAAVTMGD FNNAILRVVA GLEKRNRLLN
PAERRVVAFH ELGHAMVALA LPGTDAVHKV SIIPRGIGAL GYTVQRPTED RFLMTRAELE
NKMAVMMGGR AAERVVFNEI STGASDDIVR ATDLARAMVL RYGMTEALGN VAYDRERSQF
LQPGIPMPQS RDYSEETANT VDSTVRALVD GALKRAIEIL ENNRALLDRT AEELLRVETL
NEPEIENLKR QITARPVLPR TDTPLPEKDK ALEKA