Gene Nmul_A1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1761 
Symbol 
ID3783961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2011558 
End bp2012607 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content57% 
IMG OID637811847 
Productbeta-hexosaminidase 
Protein accessionYP_412450 
Protein GI82702884 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTCG GTCCCGTCAT ACTCGATATC GAAGGCACGC AACTCACCGC CAACGATAAG 
AAAAAACTCC GGCACCCGCT GGTTGGAGGG GTGATCCTGT TCACGCGCAA CTATTCCTCG
CTTGCGCAAC TCATGCATCT GACTGCTGAA ATCCATGCGC TGCGAACGCC GCCACTCCTG
GTCGCCGTCG ATCACGAAGG AGGCAGAGTT CAACGTTTCC GGGAAGATTT CACGCGCCTG
CCCCCCATGC GAGAACTGGG CAGGATCTGG GATGAGCATC CTGCCCAGGC GCGGCATCTG
GCGCATGAGG CGGGATATGT CCTGGCGGCG GAACTACGGG CTGCGGGCGT GGACTTCAGT
TTTACACCGG TCCTGGATAT GGATTATGGC CAAAGCAGCG TCATCCGCGA CCGTGCTTTT
CACCGTGACC CGCAAGCCAT TGCCGAGCTG GCCCATAGCC TGATGAGCGG GTTGAAATCA
GCCGGAATGG CAGCGGTGGG CAAACATTTC CCTGGTCATG GTTATATCGA GGCCGATTCC
CATTTCGAAA TGCCGGTGGA CGAACGAACT TACGCGCAGA TCGAAATGGA CGATCTTATT
CCATTCCGTA AAATGATCGG TTTCGGCCTT ACCGGCATGA TGCCTGCCCA CGTCATTTAT
CCAAAGGTGG ATGCATTACC GGCCGGTTTT TCCGAAGTAT GGCTCAAAAA GGTTTTGCGG
GGTGAGCTGG GTTTCGAAGG GTGTATCTTC AGCGACGATC TGAATATGGC GGGAGCAGCT
TTTGCAGGCA ATCCGGTGGA GCGGGCCCAG AAAGCATTGC ATGCGGGATG CGACATGGTG
CTTCTGTGTA ATAACCCGGA AGCGGCCGAA ATGCTGCTCG CGGAGCTACA TTGGGACCTG
CCCGCCCTTG GGGTGATTCG TCTCGCCCGC ATGCGCGGGC GCCCAAACCC GGATTCGCTG
GTGAAACTGC ACGAAAACCC GAACTTCGTC AGTGCCGTGG AAAAAATTGC GGGTATCGGC
GTTCGCAGCG GCGAGTTGCC GCTGGTGTAG
 
Protein sequence
MSLGPVILDI EGTQLTANDK KKLRHPLVGG VILFTRNYSS LAQLMHLTAE IHALRTPPLL 
VAVDHEGGRV QRFREDFTRL PPMRELGRIW DEHPAQARHL AHEAGYVLAA ELRAAGVDFS
FTPVLDMDYG QSSVIRDRAF HRDPQAIAEL AHSLMSGLKS AGMAAVGKHF PGHGYIEADS
HFEMPVDERT YAQIEMDDLI PFRKMIGFGL TGMMPAHVIY PKVDALPAGF SEVWLKKVLR
GELGFEGCIF SDDLNMAGAA FAGNPVERAQ KALHAGCDMV LLCNNPEAAE MLLAELHWDL
PALGVIRLAR MRGRPNPDSL VKLHENPNFV SAVEKIAGIG VRSGELPLV