Gene Nmul_A2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2737 
Symbol 
ID3785708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3142054 
End bp3143451 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content57% 
IMG OID637812828 
Productpeptidase M16-like 
Protein accessionYP_413416 
Protein GI82703850 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTATC CGTGCTTTCG CTTTTTGCCG GCTTGCTTAT GGGCGGCTCT CGTAATCCCC 
GTCTCGCTGT TCTCCTCCGC CGCTCTTGCC AGCACGCATG AATTCACTCT GGGTAATGGC
TTGCGGCTCA TCGTCAAGGA AGATCATCGT TCCCCGGTCG TCATCTCGCA AATCTGGTAC
AAGGCCGGGA GCATCGACGA GGTAAACGGC AGGACTGGCG TCGCCCACGT GCTTGAACAC
ATGATGTTCA AGGGCACGAA GAAAGTACCG GGCGGCGAGT TTTCCCGCCT CATCGCAGCG
GCGGGGGGCC GTGAAAACGC TTTCACGGCG CAGGATTATA CGGCCTATTT TCAGCAATTG
CATAAGTCCC GGCTGCCGCT GGCGATGGAA CTGGAATCGG ACAGGATGCG CAATCTGGTG
CTGACAGAAG AGGAGTTTTC CAAGGAAATC AAAGTCGTGA TGGAAGAACG GCGGTTACGT
ACGGACGATC AGGCGCGCTC TCTGGTCCAT GAGACACTTA TGGCGACCTC CTATCAGTCG
CACCCTTACC GGCATCCGGT GATAGGCTGG ATGAACGATC TGCAGAACAT GACTGTGGGG
GACGCCCGGC AATGGTATGA ACGCTGGTAT GCGCCCAACA ATGCTGTGCT TGTGGTGGTC
GGGGATGTAG ACCCCCGGCA GACCTTCAAT CTGGCGCGAA AATACTACGG ACAGATAAAG
GCGAAGCCCG TGCTGTCCCT GGACCAACGC AAGCCACAGA TCGAGCCGAA GCAGCTCGGC
GTCAAGCGGC TGACAGTAAA AGCGCCCGCG CAATTGCCCT ACGTTGCAAT GGCTTATCAT
GCTATATCGC TGAGCAAGCC GGAAACGGAC TGGGAGCCGT ATGCACTCGA GATGCTGGCG
GGCGTGCTCG ATGGGAATGA ATCGGCGCGG CTCAATAAAG CCCTGGTGCG TGAGCAGCGC
ATCGCCAGCA CGGCCGGAGC CAGCTATGAT TCCACCGCGC GAGGTCCCGC GGTGTTTTAT
CTCGATGGCA CACCGAGCGA AGGCAAAACA GTGGGTGAGG TGGAAGCGGC GTTGCGTGCC
GAGATCGAAA AGCTCGTACG TGACGGCGTG ACAGAGGAGG AGCTTGCGCG CGTCAAGGCA
CAGGTGGTGG CGGGACACGT TTTTCAGCTC GATTCCATGT TTTTCCAGGC AATGCAGATC
GGCCAGCTCG AAAGTATAGG ATTGTCCTAT CGCGATGTGG ACACCATATT GCGTAAATTG
CAGGCTGTTA CCGCCGAACA GGTGCGCGAG GTGGCAAAAA AATACTTGAA GGACGATAAT
CTGACGATTG CAGTACTCGA TCCGCAGCCC CTGGAACAGA AAGCACCCGC GGCAGTACCG
GCGGGTTTGC GGCATTAA
 
Protein sequence
MTYPCFRFLP ACLWAALVIP VSLFSSAALA STHEFTLGNG LRLIVKEDHR SPVVISQIWY 
KAGSIDEVNG RTGVAHVLEH MMFKGTKKVP GGEFSRLIAA AGGRENAFTA QDYTAYFQQL
HKSRLPLAME LESDRMRNLV LTEEEFSKEI KVVMEERRLR TDDQARSLVH ETLMATSYQS
HPYRHPVIGW MNDLQNMTVG DARQWYERWY APNNAVLVVV GDVDPRQTFN LARKYYGQIK
AKPVLSLDQR KPQIEPKQLG VKRLTVKAPA QLPYVAMAYH AISLSKPETD WEPYALEMLA
GVLDGNESAR LNKALVREQR IASTAGASYD STARGPAVFY LDGTPSEGKT VGEVEAALRA
EIEKLVRDGV TEEELARVKA QVVAGHVFQL DSMFFQAMQI GQLESIGLSY RDVDTILRKL
QAVTAEQVRE VAKKYLKDDN LTIAVLDPQP LEQKAPAAVP AGLRH