Gene Nmul_A1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1991 
Symbol 
ID3785015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2287557 
End bp2288771 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content55% 
IMG OID637812080 
ProductSerine-type D-Ala-D-Ala carboxypeptidase 
Protein accessionYP_412678 
Protein GI82703112 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACTGATT CAGAAGCCCG GCGGATAACG TCTTCGATAT GGCTGGGATA TAATAACGCT 
TTTTTGCAGA TCGCCATGAG ACGCTTGCTT CCTATATCGC TGTGTCTGCT TGCCCTGCCA
CTGGCTGCTC AACAACCCCA GTTTCACGCG CCGCCGCAAA CTCTCTCCGT CGCCGCCAAG
TCTTATATAC TTGCCGATCT TCAGAGCGGA CAGGTGCTTG TGAGCAAGAA TGCCCATGAA
CGCGTCGATC CGGCATCGCT GACAAAGCTG ATGACGGCTT ATGTGGTTTT TGCAGCTCTG
TATCAGAAAC GTGTCACCTT GACGCAGGCT GTGCCAGTCT CGACACGTGC CTGGCGGGCG
CAAGGCTCCC GCATGTTCAT CGAGCCGAAG AAGCCCGTGA CGGTCGATGA ATTGATGCGC
GGCATGATCG TGCAGTCAGG AAACGACGCC TCCATTGCAC TGGCGGAGGC CGTTTCAGGA
TCAGAGGAGG CATTCGCTCA AGCGATGAAC AAGGAGGCGG CGCGCATGGG CATGAAGAAC
ACCCGTTTTG CCAATTCAAC CGGACTTCCG GACCCGGACC ATTACACCAC CGCGTACGAT
CTCGCCTTGC TCGCAACCGC CATCATTCGC GATTTTCCGG AATATTATCC ACTCTATTCC
CTCAAGGAAT ATACCTATAA CAAAATTACT CAGGCGAACC GGAACCGCCT GCTCTGGCTC
GACCCGAATG TCGACGGGAT GAAGACGGGA CACACTGACG CAGCCGGTTA CTGCCTCATT
ACTTCAGCCA GGCGGGGACA GCGTCGGTTG GTTGCGGTAG TGATGGGAAC CGCCTCGGAG
AGCGCGCGCG CGATGGAAAG CCAGCGCCTG CTGAATTATG GTTTTCAGTT CTACGATACG
GTTCGTCTCT ATCCGGGAGA GCAGGAGGTG GTTGCCATTC CATTATGGAA AGGCAACCAG
GACAAACTCA GGACGGGATT CGGAAATGAT GTTTATTTTT CGCTTCCCCG CCATCAGACT
GACAAACTCA AGGCCAGGAT GGAATATAAG CAGCCGCTTC TCGCTCCTGT CGCTGCGGGC
CAAAAAGTTG GTACAGTGAA GTTTATGCTC GAAGGGAAGC AGGTGGTTGA ACATCCGCTG
GTAGCGCTCG AAACCGTCAG CGCTGCGAAT ATTTTTGGCC GGGCATGGGA TAGCATGCGG
CTTCTATTTA ACTAG
 
Protein sequence
MTDSEARRIT SSIWLGYNNA FLQIAMRRLL PISLCLLALP LAAQQPQFHA PPQTLSVAAK 
SYILADLQSG QVLVSKNAHE RVDPASLTKL MTAYVVFAAL YQKRVTLTQA VPVSTRAWRA
QGSRMFIEPK KPVTVDELMR GMIVQSGNDA SIALAEAVSG SEEAFAQAMN KEAARMGMKN
TRFANSTGLP DPDHYTTAYD LALLATAIIR DFPEYYPLYS LKEYTYNKIT QANRNRLLWL
DPNVDGMKTG HTDAAGYCLI TSARRGQRRL VAVVMGTASE SARAMESQRL LNYGFQFYDT
VRLYPGEQEV VAIPLWKGNQ DKLRTGFGND VYFSLPRHQT DKLKARMEYK QPLLAPVAAG
QKVGTVKFML EGKQVVEHPL VALETVSAAN IFGRAWDSMR LLFN