Gene Nmul_A0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0853 
Symbol 
ID3784541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp969465 
End bp970760 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content60% 
IMG OID637810935 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_411548 
Protein GI82701982 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACGC AACCAGCGAG CTCTCTTGGA AAGGTACTAG CGGGGCTTCA GAACCTGCTG 
CCGGATCTGG AGGCGTTATA TACCGACGTG CATGCCCATC CGGAGCTGTC GATGCAGGAA
TCGCGTACCG CAAGCCTAGT CGCGGAACGA CTTCGCGCGG CTGGGTATGA CGTGACGACG
AGGGTTGGGA AGACGGGTGT CGTGGGATTG CTGCGCAACG GCGACGGGCC GACCGTCATG
CTGCGTGCCG ACATGGATGC ACTACCCATC GAGGAAATGA CCGGGCTTCC CTATGCCAGC
AAGGTCAAGG CCACGAATCA CGAGGGTAAA ACGGTCCCGG TCATGCACGC TTGCGGCCAT
GACATGCACG TTGCCTGGCT CGTCGGTGCG ACCACGCTGC TCGCGCAGGC GCGTAATACA
TGGGGCGGCA CGTTGATGGC AGTCTTCCAG CCGGCTGAAG AGACTGCGGA AGGCGCCCAA
GCCATGATCG ACGATGGACT GTTCAACCGT TTTCCAATGC CGGATGTCGT GCTTGGCCAA
CACGTCATGG TGGGGCCGGC GGGCAATATT GGCGGCCGTG CCGGATCCAT CACTTCCGCT
GCCGACAGCC TGCAGATCCG CCTGTTTGGG CGTGGGGCGC ACGGATCAAT GCCGCAGGCA
AGCATCGATC CGGTTGTCAT GGCTGCCGCG ACTGTAATGC GCCTGCAGAC CATTGTCTCG
CGTGAACTTG CTGCTGCCGA GGCCGCTGTC GTTACCATTG GCGCGTTGCA GGCGGGCACC
AAGGAAAATG TGATACCCGA CGAGGCGGTC ATCAAGCTGA ATGTACGCAC CTTTGATGCG
GATGTGCGCA AGCGTGTACT TGCCGCCATC GAGCGTATCG CCAATGCAGA GGCTGCAGCT
TCGGGAGCCC CCCGGCCGCC CGAGATTACG ACGCTGGAAC ACTACCCTCT AGGAGTCAAC
GATGCCGATG CAAGCGGCCG CGTCGCCGAT GCTTTCCGTC AATATTTCTC AGCCGACCGC
GTGCGGCAAG TCGATGCGGC GTCGGCGAGC GAGGATTTCG GGTTGTTCGG AACCGAGTGG
GGTGTCCCTT CCGTGTTCTG GTTCGTCGGA GGTACCGATC CCGACCTTTA CGCGAAAGCC
AAGGCCGCAG GTGAAATCAA CAAGATTCCA ACGAACCACA GTCCATACTT TGCACCAGTA
ATGCATCCCA CTCTGGAAAC TGGCGTGGAA ACGATGGTCA TTGGCGCCCT GGCTTGGCTT
CAGCATGAGT CGCAGCACCA GGAGCTGAGA CCATAA
 
Protein sequence
MNTQPASSLG KVLAGLQNLL PDLEALYTDV HAHPELSMQE SRTASLVAER LRAAGYDVTT 
RVGKTGVVGL LRNGDGPTVM LRADMDALPI EEMTGLPYAS KVKATNHEGK TVPVMHACGH
DMHVAWLVGA TTLLAQARNT WGGTLMAVFQ PAEETAEGAQ AMIDDGLFNR FPMPDVVLGQ
HVMVGPAGNI GGRAGSITSA ADSLQIRLFG RGAHGSMPQA SIDPVVMAAA TVMRLQTIVS
RELAAAEAAV VTIGALQAGT KENVIPDEAV IKLNVRTFDA DVRKRVLAAI ERIANAEAAA
SGAPRPPEIT TLEHYPLGVN DADASGRVAD AFRQYFSADR VRQVDAASAS EDFGLFGTEW
GVPSVFWFVG GTDPDLYAKA KAAGEINKIP TNHSPYFAPV MHPTLETGVE TMVIGALAWL
QHESQHQELR P