Gene Nmul_A2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2267 
Symbol 
ID3785429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2577515 
End bp2579680 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content57% 
IMG OID637812355 
Productsucrose-phosphate phosphatase 
Protein accessionYP_412951 
Protein GI82703385 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR01482] Sucrose-phosphate phosphatase subfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR01485] sucrose-6F-phosphate phosphohydrolase
[TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial
[TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0311712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGT TATACGTACT GATGCTGAGC CTGCACGGCC TGATTCGGGG AAACGACATG 
GAGTTGGGGT GCGATGCGGA TACGGGGGGC CAGGTGCTGT ATGTAGTGGA GTTGGCGCGA
GCCCTGGCGC GTCAGCCGCA GGTGGGCAAA GTGGATCTGC TCACGCGAAG GATAGAAGAT
CCTTCGGTTT CCCCGGATTA TGCCCGTCCT GAGGAAACGC TGGGCAACAA TGCGCGCATT
ATCCGCCTTC AGTGCGGTCC CCGGCGCTAT CTGCGCAAGG AAAGCCTTTG GCCCTACCTG
GATCAACTGG TGGATCGCGC GCTGCTTTTT CTCCGCGGGC AGAAGCGCTT GCCGGATGTT
ATCCACAGTC ATTATGCCGA CGCCGGTTAT GTCGGCATGC AGCTTTCCCA ACTATTGGGC
ATCCCTCAGA TTCATACCGG ACACTCATTG GGACGCAGCA AGCAACAACG CCTGCTGGCA
CAAGGCCGGA AGCCACAGGC ACTGGAGCGC CAGTTCAGCT TTTACAGGCG AATAGCTACC
GAGGAAGCAG TATTGCAACA TGCTAGCCTG ATCATAACCA GCACGCCTCA GGAAAGCGTC
GAGCAATACG GGTTATACAC CAATTATCAC CCCGAACGCG CGGTTGTCAT TCCGCCGGGG
ACCGATATAT CGCGTTTCTC TCCTCCCAAT CGTCAAAAAC CAGTGGAAGT GGAGACTGCG
GGTCTCATCG ATCGCTTTCT CGCGCACCCG CGCAAGCCCC TGATATTGAC CATTTGCCGT
CCTGAAATTC GAAAAAATCT GGGGGCGCTG GTCGCTGCCT TCGGCAGTTC GCCTAAACTC
CATGAACAGG CCAATCTCGC CATCGTGGCA GGCAACCGGG ATGACATCCG TCAACTGGAT
GCTGCCCAGA ACGAAGTGAT GACCGGCCTG CTGCTGGATA TCGACCGCTA CGATTTATGG
GGCAAGGTGG CCCTGCCCAA ACACCACAAG CCTTCCGACA TCGCCGGTTT CTACCGGCTC
GCAGCTCAGC GCCGTGGCGT GTTTATCAAT CCGGCGCTTA CCGAACCCTT CGGCCTTACA
CTGATCGAAG CTGCCGCCAG CGGGCTTCCC ATTGTCGCGA CAGAAGATGG AGGGCCTCGC
GATATCGTGG CGAACTGCAA AAATGGGCTA CTGGTAAATC CATCGGACAT CGGCGCGATT
GCCGGAGCGA TCGAGTATGC GCTTGCTGAT CCCGTGCGCT GGCGACGCTG GGCGCGAAAT
GGCGTTTCAG GGGTAAAAAA CCACTACACG TGGGACGCGC ATGTCAGGAA ATATCTGCAT
GTTCTGTCGC GTCTGCTGCA TCACGAGCGT AAGCGCATCC GCCGGAACCT GGCGATATAT
CAAAGGCAAC CGCGCCCGTT GCCCTTGATA TCGCATATGC TCATCACGGA TATAGACAAT
ACGTTGCTTG GCGACCGTGC TGCCCTGCGT CGCCTTCTTG CTATTTTGCG CGCGACCCCC
CCCAACCTGG GCTTTGGTGT CGCTACTGGC CGCACGCTGG AAAGCGCCGT GAAAATATTG
AAGGAATGGG GCGTACCCTT GCCCGATGTG CTGATCACCG CAGTGGGCAG CGAAATCTAT
TATGGTCCCG AGCTTCGTCC GGATACCGGG TGGCAGAACC TGATCAAGTA TCTATGGCGC
CGCGATGCCA TTGAAAACGT ATTGCGGGGG GTGCCGGGAC TGACGCTGCA GGCAGCCGAA
AACCAGCGCG AATTCAAGCT CAGCTACAAC GTGGATCCGG AAAAAATGCC GCCAATAGCC
AAAATCCGCA CATTGTTGCG CGAGCAGAAC CTGTCGGCAC ACCTGATTTA CTCGCGCCGG
ACTTATCTCG ACGTGTTGCC ACTGAGAGCG TCCAAAGGGC GGGCAATACG TTATCTTGCC
TACAAGTGGG GGCTGCCGCT ACGTGCGTTT CTGGTCGCAG GAGATTCCGG CAACGATCAT
GAAATGCTGA TCGGTGATAC CCTCGGCGTC ATAGTCGCCA ACCATAGTCC CGAACTCGCA
AGCCTGAAAG GAAATGAGCA AATCTATTTT GCCCGGTCCG CATATGCCGA TGGCATTGCA
GAGGGCATGG CACATTATGA ATTTGGTACT TCCATCATGG AGACTGCAAA TGCTGCACAA
ATTTGA
 
Protein sequence
MKELYVLMLS LHGLIRGNDM ELGCDADTGG QVLYVVELAR ALARQPQVGK VDLLTRRIED 
PSVSPDYARP EETLGNNARI IRLQCGPRRY LRKESLWPYL DQLVDRALLF LRGQKRLPDV
IHSHYADAGY VGMQLSQLLG IPQIHTGHSL GRSKQQRLLA QGRKPQALER QFSFYRRIAT
EEAVLQHASL IITSTPQESV EQYGLYTNYH PERAVVIPPG TDISRFSPPN RQKPVEVETA
GLIDRFLAHP RKPLILTICR PEIRKNLGAL VAAFGSSPKL HEQANLAIVA GNRDDIRQLD
AAQNEVMTGL LLDIDRYDLW GKVALPKHHK PSDIAGFYRL AAQRRGVFIN PALTEPFGLT
LIEAAASGLP IVATEDGGPR DIVANCKNGL LVNPSDIGAI AGAIEYALAD PVRWRRWARN
GVSGVKNHYT WDAHVRKYLH VLSRLLHHER KRIRRNLAIY QRQPRPLPLI SHMLITDIDN
TLLGDRAALR RLLAILRATP PNLGFGVATG RTLESAVKIL KEWGVPLPDV LITAVGSEIY
YGPELRPDTG WQNLIKYLWR RDAIENVLRG VPGLTLQAAE NQREFKLSYN VDPEKMPPIA
KIRTLLREQN LSAHLIYSRR TYLDVLPLRA SKGRAIRYLA YKWGLPLRAF LVAGDSGNDH
EMLIGDTLGV IVANHSPELA SLKGNEQIYF ARSAYADGIA EGMAHYEFGT SIMETANAAQ
I