Gene Daro_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1248 
Symbol 
ID3569355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1354594 
End bp1355640 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content59% 
IMG OID637679714 
ProductN-acetylneuraminate synthase 
Protein accessionYP_284473 
Protein GI71906886 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID[TIGR03586] pseudaminic acid synthase 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value0.862744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTT CAATTGATGG CCGTCGCATT GGCCACGATG CTCCGCCGTT CATCATTGCC 
GAGCTGTCGG CCAACCACAA CGGCTCGCTG GAGCGTGCGC TTCAAACCAT TGATGCAGCC
AAAGCCTGTG GGGCGGATGC AATCAAATTG CAGACCTATA CGGCCGACAC GATGACAATC
GACTGCGATC AGCCTGAGTT CATGATCCGC GGCGGCCTGT GGGATGGATA CAAGCTGTAC
GACCTCTATC AGTGGGCACA GACACCTTTC GACTGGCACA AGGCCATGTT CGAACATGCG
CGAAAGATAG GCATCACGGT CTTTTCGACA CCCTTCGATG AGAGCGCTGT CGACTTGCTG
GAAGCCCTCG ATACGCCAGC TTACAAAATC GCCTCGTTCG AACTGACAGA CCTACCCCTG
ATCCGCTACG TGGCCGCGAC CGGCAAGCCG ATGATCATGT CGACCGGAAT GGCCAGCGAG
GCTGAAATCG AGGAAGCAGT GAGCGCGGCC CGTGAAGCGG GTTGCACCGA CCTTGTCCTG
CTCCATTGCA TCAGCAGCTA TCCCGCACCG ATGGATCAGG CCAAACTGCG ACAGATCGCG
GGCCTTGAAA GCCGCTTCGG CGTCACGCCG GGCCTGTCCG ATCACACGCT TGGCACGGTA
GCCTCGGTGG CTGGTGTAGC CCTCGGCGCT TGCGTAATCG AAAAACATTT CACCCTGAGC
CGCGCGGACA AGGGGCCGGA CAGCGAGTTC TCCCTTGAAC CGGACGAATT GCGCCGGCTG
TGCCAGGATG CCCGCGACGC CTGGTCGGCA CTTGGAAGCC TCGGGTTTGA ACGGCAGCAA
GCTGAGGAGG CGAGCAAGGT CTTCCGGCGG TCGGTGTATT TCGTGCGCGA TGTGAGCGCT
GGTACCGTGA TAGGAGCGGA ACACATCCGT CGCATACGCC CGGGGATGGG GCTTGAACCA
AAATACTTTG ATCAGTTGAT CGGCAGGCGC GTGAATCAGG ATGTCTCGCG CGGCACGCCA
GTGAAATGGA CGCACTTCGA TGAATAG
 
Protein sequence
MSFSIDGRRI GHDAPPFIIA ELSANHNGSL ERALQTIDAA KACGADAIKL QTYTADTMTI 
DCDQPEFMIR GGLWDGYKLY DLYQWAQTPF DWHKAMFEHA RKIGITVFST PFDESAVDLL
EALDTPAYKI ASFELTDLPL IRYVAATGKP MIMSTGMASE AEIEEAVSAA REAGCTDLVL
LHCISSYPAP MDQAKLRQIA GLESRFGVTP GLSDHTLGTV ASVAGVALGA CVIEKHFTLS
RADKGPDSEF SLEPDELRRL CQDARDAWSA LGSLGFERQQ AEEASKVFRR SVYFVRDVSA
GTVIGAEHIR RIRPGMGLEP KYFDQLIGRR VNQDVSRGTP VKWTHFDE