Gene Nmul_A2736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2736 
Symbol 
ID3785707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3140727 
End bp3142037 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content55% 
IMG OID637812827 
Productpeptidase M16-like 
Protein accessionYP_413415 
Protein GI82703849 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCGTC ACCTGTTTTT CCTTTTCCTG GGCTTTTATT CCCAATGGGC GTACGCCACT 
CTGCCAATCC AGCACTGGCA GGCAAACTCG GGTGCCCGCG TTTACTTCAT CGAAAGCCGC
GACTTGCCCA TTCTCGATGT GAGTGTGGAT TTCAGTGCGG GTAGCAGCAC CGACACACCC
GATAAATCAG GCCGCGCGGC GATGGCCCTG CATCTCGTGA ATCTGGGAGC GGGAGGATTG
ACCGAAGACC AGCTCACCAA GGGATTCGCT GATGTCGGCG CGCAGCTGGG CGCCCATTTC
GATCAGGATC GGGCGGGGAT TACGCTCAGA ACATTGAGCA GCGCGCGCGA GCGTGGCCGC
GCACTGGAAT TGTTCGGCAA GGTCATCCAG CACCCCGATT TTCCTGAATA CGTCCTGGGG
CGGGAAAAAG CGCGTGTCAT CGCAGGGCTC AAGGAAGCGG ATACCAAACC CGGCAATATT
GCCGACCGAT CGCTGATGAA AATGCTCTAC GGTACCCATC CTTATGGATT GAGAGGTTCG
GGTGAGATCG AGAGCGTGAG CAAGCTGGGG CGTCAGGATA TGATCGATTT TCACCGGTTT
CGCTATACGG CGGTAGATGC GGTGGTGTCG ATCATGGGAG ATGTAAGCCG TGACGAAGCC
GCAGCGATTG CCGAATCCCT GACCAAGGAT CTGCCACGTG AAAAACGAGG ACAAAGCATC
CCGGCAGTGA CGCCACCCGT CCAGGGCACC CAGCGGATCG CCCATCCGGC TACCCAGAGT
CATATCCAGC TCGCCTATCC CGGGATCAAA CGCGATGATC CCGATTATTT TCCTCTCATT
GTGGGAAATC ACATCCTTGG CGGGGGCGGA TTCACCTCAC GCCTGATGGA GGAAATCCGC
CAGAAACATG GGCTGGCCTA CAGTGTTCAC AGTTCGTTTA CCCCCCTGAA AGAAGAAGGC
CCTTTCGAGA TCGCATTGCA AACCCAGAAA GAACAGTCTG AAGAGGCGCT TTCCATAACC
CGGAAAGTCC TGGCTGATTT TATTGCCGGA GGGCCAACGG AAAAAGAACT GATCGAAGCA
AAAAAAAATA TCATCGGCAG CTTCCCGCTG CGTATCGACA GTAACAAGAA GATACTCGGA
TACCTCGCCA TGATCGGTTT TTACAATCTG CCCCTGACCT ATCTGAATGA TTACGTAAAG
GCGGTATCGA AGGTAACCAT CCCCCAGGTA ACCCAGGCTT TCCAGCGACG CATCAATCCC
TCCGGCATGG TAACCGTGGT GGTGGGTCTT CCGGACACGA GTGGAAAGTA A
 
Protein sequence
MLRHLFFLFL GFYSQWAYAT LPIQHWQANS GARVYFIESR DLPILDVSVD FSAGSSTDTP 
DKSGRAAMAL HLVNLGAGGL TEDQLTKGFA DVGAQLGAHF DQDRAGITLR TLSSARERGR
ALELFGKVIQ HPDFPEYVLG REKARVIAGL KEADTKPGNI ADRSLMKMLY GTHPYGLRGS
GEIESVSKLG RQDMIDFHRF RYTAVDAVVS IMGDVSRDEA AAIAESLTKD LPREKRGQSI
PAVTPPVQGT QRIAHPATQS HIQLAYPGIK RDDPDYFPLI VGNHILGGGG FTSRLMEEIR
QKHGLAYSVH SSFTPLKEEG PFEIALQTQK EQSEEALSIT RKVLADFIAG GPTEKELIEA
KKNIIGSFPL RIDSNKKILG YLAMIGFYNL PLTYLNDYVK AVSKVTIPQV TQAFQRRINP
SGMVTVVVGL PDTSGK