Gene Nmul_A0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0554 
Symbol 
ID3784737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp640663 
End bp642036 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content58% 
IMG OID637810636 
Productpeptidase M23B 
Protein accessionYP_411254 
Protein GI82701688 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0864407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGGCA ACAAGCAGAC ATGGGCGCGC AAAAAAAACA GATTACATTG GCTCCTTTAT 
GAGGGACTTG AGATAGTGAG TAACAGCCAG GCTTTTACAC TTACAAGAAA AACCCTGCGC
GGCCTGATAC TGTTATCGAG CATTCCGTTG TTTGGAATGG TGGCAGCCTT CGGTATTGCC
CCCGATACTG CGGTGGAAGA CGTACCGGTC GAGCAGGTTG TCCTCGGTCT GGAGATTCCG
GAGATTCGCT CGAGGCCGGC GGAGGGGATG ACTTTTTGGC GTCATGAACG TATCCAGCAG
GGCGATACAA TCGGGAGCCT GCTTTCCCGG CTTGAAGTGA ATAATCAGGA CGTGGCACGC
CTCATTCGGG ATACCTCCGA GCTGAAGGCC TTGCATCCGC TGGCCGCGGG CAGAATGGTG
CATGCCGAAA CCAGCGCCGC GGGCGAATTG CTGCTGCTGC GCTACTTCCC CGGCGGCAGC
GATCAGGTGG TGCTGGAAAA ACGCGACGGC AGCTATGTGG TGAGCGACAG GCCGGCATTG
CTGGAAACCC ATATCCAGAT GAAATCAGGC GTGATCGAAA GCTCGCTTTT TGCCGCGATC
GACCGCGCGG GGATTCCGGA CAGCATAGCT TCCCAGATCG TCGATATCCT GTCTTCCCAA
ATAGATTTCC ACCGTGATCT GCGCAAGGGT GACCGTTTTA CAGTGGTGTA CGATTCCCTC
TACGGCAACG GGGAACCGAC GAGAGCCGGC CGGGTGCTGG CGGTGGAGTT CGTCAACCAG
GGAGTGCCTT ACCGGGGAGT ATATTTCCCC GGAAGCGACG GTGGAGAAGG CGGCTATTAC
ACGCCGGACG GCAAGAACCT GCGCAGGGTA TTTCTGCGCT CGCCGCTGGA ATTTTCCCGC
ATCAGTTCCG GCTTTTCCAG CGGCCGCTTC CATCCCATCC TGAAAAAATG GCGGGCCCAT
AAGGGCATCG ACTACGTGGC GCCCACCGGC ACGGGGGTAA AGGCGGTTGC CGATGGCGTC
GTGGCGGTAG CGGGATGGGA AGCGGGATAT GGAAATTTCA TCATCCTCGA GCATGAAGGA
TCGTATGCCA CGGTCTACGG CCACCTGTCG GCTTTCGCCA AAGGGTTGCG CAAGGGTCAG
CGTGTCCGTC AGGGATATGT CATTGGCCGG GTGGGAGCCA CCGGCTTGGC GAGCGGGCCT
CATCTGCACT TTGAGTTCCG TGTCAACGGC ATTCAACGCG ATCCTCTGAA GGAGCCGATG
CCGGAAGGAA AACCGATCGC TCCCGCGCAC CTCGCGGCAT TTTACGAATC CACGAAATCA
TCGATGGCGA GGCTCGATAT GCTGCACGGC ACCAATCTCG CATTGCTGGA TTAA
 
Protein sequence
MPGNKQTWAR KKNRLHWLLY EGLEIVSNSQ AFTLTRKTLR GLILLSSIPL FGMVAAFGIA 
PDTAVEDVPV EQVVLGLEIP EIRSRPAEGM TFWRHERIQQ GDTIGSLLSR LEVNNQDVAR
LIRDTSELKA LHPLAAGRMV HAETSAAGEL LLLRYFPGGS DQVVLEKRDG SYVVSDRPAL
LETHIQMKSG VIESSLFAAI DRAGIPDSIA SQIVDILSSQ IDFHRDLRKG DRFTVVYDSL
YGNGEPTRAG RVLAVEFVNQ GVPYRGVYFP GSDGGEGGYY TPDGKNLRRV FLRSPLEFSR
ISSGFSSGRF HPILKKWRAH KGIDYVAPTG TGVKAVADGV VAVAGWEAGY GNFIILEHEG
SYATVYGHLS AFAKGLRKGQ RVRQGYVIGR VGATGLASGP HLHFEFRVNG IQRDPLKEPM
PEGKPIAPAH LAAFYESTKS SMARLDMLHG TNLALLD