Gene Nmul_A1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1558 
Symbol 
ID3785280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1788374 
End bp1789318 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content55% 
IMG OID637811646 
Producthistone deacetylase superfamily protein 
Protein accessionYP_412253 
Protein GI82702687 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.643913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACCCG CTTCCCATAC CGCTTTCATT TCCCATCCGG ATTGCCTGTT GCACGAAATG 
GATTTTTATC ATCCTGAGAG TCCAGCGCGG CTGAAGGCGA TCGAGGACGA GCTGTCTGCT
TCAGGACTGA TGGATAAACT TAGACGCTAT CAGGCGCCAT TAGCGACTGT CGGCCAACTG
GAGCGTGTGC ATACACGGGA GCATATTGCA AGGTTGCACG CTGCAGCATC CCGCGCGGCT
TCGGGAGGTT TCGTCTACCT TGATCCGGAT ACCGCCATGA ACCGCCACAG CCTTGGAGCA
GCTTATCGGG CCGCGGGGGC TGTTGTCCTC GCTGCCGATC TCGTGATAGA AGGAGCGGCG
GAAAATGCAT TTTGCAGTAT TCGTCCCCCG GGTCACCACG CGGAACGCGG ATACCCGATG
GGTTTCTGCC TGTTCAACAA TATTGCCGTA GCGGTTGCTC ACGCGCTTGA AACACATGCT
CTGAAACGTG TCGCGGTGGT GGACTTCGAC GTGCATCACG GCAACGGTAC GGAAGATATC
TTTCAGCACG ATCCCCGCGT CATGATGGTC TCGACATTTC AGCACCCGTT CTATCCATAT
AGCGGCATCG CAGGCCGTTC AGAGCGAATG GTCAACATCC CGCTGCCAGC GGGGAGCAAC
GGCAAGGTAT TTCGCAAAGC AGTGGATGAA TTCTGGTTGC CGGCGCTGGA AAGGTTTAAA
CCGCAAATGT TGTTTGTTTC TGCTGGTTTC GATGCTCATG CCGATGATGA GCTTGCTTCT
CTGAATCTGG TGGAAGACGA TTACGCGTGG GTAACTGAAA AAATCAAAGA GGTTGCCCGC
GCTTATGCCG GGAAACGTAT CGTATCGGTG CTGGAAGGCG GGTATGCGTT GGCTGCGCTG
GCACGAAGCG TGGCAGCGCA TATAGAAGTC CTTATGAAGC CCTGA
 
Protein sequence
MRPASHTAFI SHPDCLLHEM DFYHPESPAR LKAIEDELSA SGLMDKLRRY QAPLATVGQL 
ERVHTREHIA RLHAAASRAA SGGFVYLDPD TAMNRHSLGA AYRAAGAVVL AADLVIEGAA
ENAFCSIRPP GHHAERGYPM GFCLFNNIAV AVAHALETHA LKRVAVVDFD VHHGNGTEDI
FQHDPRVMMV STFQHPFYPY SGIAGRSERM VNIPLPAGSN GKVFRKAVDE FWLPALERFK
PQMLFVSAGF DAHADDELAS LNLVEDDYAW VTEKIKEVAR AYAGKRIVSV LEGGYALAAL
ARSVAAHIEV LMKP