Gene Nmul_A2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2571 
Symbol 
ID3784651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2943760 
End bp2945085 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content57% 
IMG OID637812662 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_413252 
Protein GI82703686 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGCC AAACATCCTT CCATCCCGTA CTGATAATCT TATTTTTCTT CCTTGTCTTA 
CTGGTTATTG CGCCCGCCCT GGCCAGCGGA AGAAACGACG GCATGGAAGG AACTTCCTTC
CATGATGCTT CTCCTGACAA TTCCAGCGGC AATGTTCAAC TCGGCCCTCG TCCTTTTTTC
CTTGTGGAGG ACATGGAGGG CGGTGAACTC AAGAGCAAGC TCGCACGTTG CGGTGCCGGC
CCCTTCAGGA AAACATACTT CTCCATCGGT CATCGCGGCG CGCCACTGCA GTTTCCCGAG
CACACGGCGG AGTCCTACCG GGCGGCCGCC CGCATGGGGG CCGGCATTCT CGAGTGCGAC
GTAACATTCA CCCGGGACAA GGAACTGGTA TGCCGCCATT CTCAGTGTGA TTTGCACACC
ACCACGAATA TTCTGGAAAC ACCGCTTGCC GAAAAATGCA CGCGCCCTTT TACACCCGCC
CAGTTCGATG CTTCGGGAAA CCTGATCCAG GAGGCTTCCG CCCGTTGCTG CACGAGCGAC
ATCACTCTGG ATGAATTCAA GAGCCTGAAA GGCAAGATGG ACGCCTTCAA CCCCGGGGCG
AGGAATGTGG CGGAATACGT GGGCGGCACA CCTGCATGGC GCACTGACTT GTATGCCGGC
CCTACCAGTG GAACCCTGCT CAGCCACCGG GAGAGTATCG AACTCTTCAG CAAGCTGGGG
GCAAAAATGA CACCCGAACT CAAGAGCGCG GAGGTGGCAA TGCCTTATGA CAGCGATGGC
GACGGAGTCG GCGACTACAC GCAGGAACAT TATGCTCAAC AGATGATCGA CGAGTACAAG
GCTGCCGATG TCAAACCTCG CGATGTCTTT CCGCAGTCCT TCGATATCCG CGATATCCGT
TACTGGATCG CCAGGGAGCC CGAGTTCGGG AGGCAGGCGG TTTATCTCGA CGACGCCAAT
ACGGTCGCTG ATCTTCCCAA TGCCAGTCAG TTGACCGCTT ACAAAGCCGA GGGCATCAAT
ATTGTCGCCC CTCCCATATT CGCGCTGCTG GATGTCGATG GGGGCGGCAA TATCATCCCG
TCCAGCTACG CCCTGCAGGC CAGGGCCGCA GGACTAGGCC TCATCACCTG GACGCTGGAG
CGCTCCGGAA TACTAGCTGA CGGCGATAAC GGCTTTTATT ACCAGACTTT CGATTCGGCG
ATAAGGCGCG AAGGCGATGT GATGAAAGTG CTGGATGTTT TGAACAGGGA AGTGGGCGTC
CTCGGTATCT TCAGCGATTG GCCGGCCACC GTAAGCTATT ACGCTAATTG CATGAAGCTG
AAATAG
 
Protein sequence
MIRQTSFHPV LIILFFFLVL LVIAPALASG RNDGMEGTSF HDASPDNSSG NVQLGPRPFF 
LVEDMEGGEL KSKLARCGAG PFRKTYFSIG HRGAPLQFPE HTAESYRAAA RMGAGILECD
VTFTRDKELV CRHSQCDLHT TTNILETPLA EKCTRPFTPA QFDASGNLIQ EASARCCTSD
ITLDEFKSLK GKMDAFNPGA RNVAEYVGGT PAWRTDLYAG PTSGTLLSHR ESIELFSKLG
AKMTPELKSA EVAMPYDSDG DGVGDYTQEH YAQQMIDEYK AADVKPRDVF PQSFDIRDIR
YWIAREPEFG RQAVYLDDAN TVADLPNASQ LTAYKAEGIN IVAPPIFALL DVDGGGNIIP
SSYALQARAA GLGLITWTLE RSGILADGDN GFYYQTFDSA IRREGDVMKV LDVLNREVGV
LGIFSDWPAT VSYYANCMKL K