Gene Daro_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1247 
Symbol 
ID3569354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1353341 
End bp1354597 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content57% 
IMG OID637679713 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_284472 
Protein GI71906885 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0223] Methionyl-tRNA formyltransferase
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID[TIGR03585] pseudaminic acid biosynthesis N-acetyl transferase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.498575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTAC GCCCACTCGC CGAACGCGAT CTCGCAATGG TTCGTAACTG GCGCAACCAT 
CCCTTCGTAC GTCTGAGCAT GTTCTCAACC CAATTGATTG AGGAGAGTGA ACATCTCGCA
TGGTTCGAGC GTGTGAGCGT GAATCCGGAG GTCTGCTGGT TAGTGCATGA GGACGACACG
GGAAAGCCCG ATGGCGCAGT GTACTTTACC GAATATCGGC CCCACCAAGG TATGGCCTTT
TGGGGCTTTT ATCGCGATCC CGAGGCTACG GGCGGATCAA GCACCCGGCT TGGCATGGAT
GGTCTGGACT ACGCATTCGA TCAGCTCAAG TTGCGCAAAT TGAATGCCGA CGTGCTTGCC
AATAACCAGC GCAGCATCGC TTTTCATGAA CGGCTAGGCT TTCAAAGGGA AGGCGTATTC
CGGGATGGCC ACCTTGCTGA TTCCGGGCCG GTTGACGTCA TACGCTATGG CATCCTTGAA
AACGAATGGC GCGAGCAGCG TCCGTGGGTG CTTGCCTGCC TGGAGGCTAG AGCCCACCGG
ACCTTGCCGG AGCGCTCTGA CCTCCAGCAA TACATTGTCG CTAGTTGCAA GGCTTGGCAC
CGCCCCGGCT TTGAGGCACT CCAACGCGAA ACCCCTGGGG ACTGGACCTG GGTCTCCTGT
CCGTCCGAAC TGATGGCTGC ACTGGAACAC CAATCCCCGA GCTACATTTT TTTCCTGCAC
TGGAGCTGGT TGGTGCCGAA AGATGTGTGG TCCAGATATG AGTGCGTCTG CTTTCACATG
ACAGATGTTC CTTACGGCCG AGGCGGCAGT CCCTTGCAGA ATCTGATCGC TGCCGGGCAC
ACTGAAACAA AGCTCAGCGC CTTGCGCATG CTTGCGCAGA TGGATGCCGG CCCGGTCTAC
GCCAAACGGC CATTGCACCT CAACGGGCGA GCCGAAGATA TTTACCTCAG AGCCGGAGCG
CTCAGCTTTG AGCTGATCTC TTGGATCGTG GACCAGAAGC CCGAACCGCT CGAGCAACAA
GGTGGCCCCC TTACTTTCAA GCGGCGAACC CCTGATCAAA GCGTGTTGCC AAGCCAAGGT
GCGCTGGACA AACTCTATGA CCATATCCGC ATGCTGGATG CACCAGGGTA TCCTCTGGCC
TTCATTGAGC ATGGGGCATT CCGCATCTGC TTCTCCAACG CCGAGCTGAA AAACGGGATA
CTGGAAGCCC GCGCACAAAT CAGCAAATGC CAATCTACAA AAGGAGCCGA CACATGA
 
Protein sequence
MPLRPLAERD LAMVRNWRNH PFVRLSMFST QLIEESEHLA WFERVSVNPE VCWLVHEDDT 
GKPDGAVYFT EYRPHQGMAF WGFYRDPEAT GGSSTRLGMD GLDYAFDQLK LRKLNADVLA
NNQRSIAFHE RLGFQREGVF RDGHLADSGP VDVIRYGILE NEWREQRPWV LACLEARAHR
TLPERSDLQQ YIVASCKAWH RPGFEALQRE TPGDWTWVSC PSELMAALEH QSPSYIFFLH
WSWLVPKDVW SRYECVCFHM TDVPYGRGGS PLQNLIAAGH TETKLSALRM LAQMDAGPVY
AKRPLHLNGR AEDIYLRAGA LSFELISWIV DQKPEPLEQQ GGPLTFKRRT PDQSVLPSQG
ALDKLYDHIR MLDAPGYPLA FIEHGAFRIC FSNAELKNGI LEARAQISKC QSTKGADT