Gene SeD_A2979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2979 
Symbol 
ID6872119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2868929 
End bp2871589 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content56% 
IMG OID642786016 
ProductCoA-binding domain/acetyltransferase domain-containing protein 
Protein accessionYP_002216666 
Protein GI198241823 
COG category[C] Energy production and conversion
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming)
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC AAGGACTGGA AGCGCTACTG CGGCCAAAAT CGATCGCGGT GATTGGCGCA 
TCAATGAAGC CCCACCGCGC GGGTTACCTG ATGATGCGTA ACTTGTTGGC GGGCGGATTC
AATGGCCCCG TTCTTCCCGT GACGCCCGCC TGGAAAGCCG TTTTAGGCGT CATGGCCTGG
CCGGATATCG CCAGTCTTCC TTTCACCCCC GATCTGGCTA TTTTATGCAC TAACGCCAGC
CGTAACCTGG CGTTACTGGA CGCGCTTGGC GCGAAAGGGT GTAAAACGTG CATTATCCTT
TCTGCTCCCA CGTCGCAACA TGAAGAACTT CTTGCCTGTG CCCGGCATTA TAAAATGCGT
CTGCTGGGTC CAAACAGTCT TGGGCTCCTC GCGCCGTGGC AAGGGCTGAA TGCCAGCTTT
TCTCCCGTCC CGATTAAACA GGGCAAGCTC GCTTTTATTT CCCAGTCTGC CGCCGTGTCC
AATACTATTC TTGACTGGGC GCAACAGCGT GAAATGGGCT TTTCCTACTT TATCGCGCTG
GGCGATAGCC TGGATATTGA TGTCGATGAA CTACTGGACT ATCTGGCGCG CGACAGCAAG
ACCAGCGCGA TTTTGCTCTA TCTGGAACAG TTAAGCGACG CCCGCCGTTT TGTTTCCGCC
GCCCGTAGCG CTTCACGTAA CAAACCGATT CTGGTGATTA AAAGCGGCCG AAGCCCGGCA
GCCCAGCGTT TACTTAATAC CAGCGCGGGA ATGGACCCTG CGTGGGATGC GGCCATCCAG
CGCGCAGGCC TGCTGCGAGT CCAGGATACG CACGAGCTTT TTTCCGCCGT CGAAACACTG
AGCCATATGC GTCCGCTACG CGGCGACAGA CTGATGATCA TCAGCAATGG CGCCGCGCCT
GCCGCGCTGG CGTTAGATGA GTTGTGGTCG CGTAACGGCA AGCTGGCGAC GTTGAGCGAA
GAGACCTGCC TGCAACTACG GCAGGCGCTT CCCGCGCACA TAGATATTGC CAATCCGCTG
GATCTGTGTG ATGACGCCAG CAGCGAACAT TACGTCAAAA CGCTGGATAT CCTGCTCGCC
AGTCAGGATT TTGACGCGCT TATGGTTATC CACTCTCCCA GCGCTGCCGC GCCGGGTACA
GAAAGCGCCC ATGCTCTGAT CGAGACGATT AAGCGCCACC CCAGAGGCAA GTTTGTTACG
CTGCTGACAA ACTGGTGCGG CGAGTTCTCG TCTCAGGAGG CAAGACGGCT ATTCAGCGAA
GCCGGATTAC CAACCTACCG TACGCCGGAA GGCACGATTA CCGCGTTTAT GCATATGGTG
GAATACCGGC GTAACCAGAA GCAACTGCGA GAAACGCCAG CGTTGCCGAG TAACCTGACG
TCCAATACCG CTGAGGCGCA TAATCTGTTA CAGCGGGCGA TTGCGGAAGG CGCCGCCTCA
CTGGATACCC ATGAAGTACA GCCGATTTTA CACGCCTATG GGCTGCACAC GCTCCCAACC
TGGATTGCCA GCGACAGCGC TGAAGCGGTG CATATCGCCG AACAGATAGG CTATCCGGTA
GCTCTCAAGC TGCGCTCGCC CGACATTCCG CATAAATCTG AAGTTCAGGG GGTCATGCTT
TACCTGCGGA CCGCAAGCGA GGTACAACAG GCCGCGAACG CCATTTTTGA TCGTGTAAAG
ATGGCCTGGC CGCAGGCGCG GATTCACGGT TTGCTGGTAC AAAGCATGGC TAACCGCGCC
GGCGCGCAGG AGCTTCGTGT GGTGGTCGAG CACGATCCGG TGTTTGGTCC TTTGATTATG
TTGGGTGAAG GCGGCGTAGA GTGGCGTCCG GAAGAGCAGG CGGTCGTCGC GCTGCCGCCG
CTCAACATGA ATCTGGCGCG CTATCTGGTG ATTCAGGGCA TTAAACAGCG GAAAATTCGC
GCCCGTAGCG CGCTGCGTCC GCTGGATATT GTCGGTTTAA GCCAATTGCT GGTCCAGGTT
TCAAACCTGA TTGTCGACTG CCCGGAAATT CAGCGTCTGG ATATCCATCC GCTGCTGGCT
TCCGCCAGTG AGTTTACCGC GCTGGATGTG ACGCTGGATA TTGCCCCGTT TGATGGCGAT
AACGAAAGTC GACTTGCGGT ACGCCCCTAT CCCCACCAGC TTGAAGAGTG GGTGGAGATG
AAAAACGGCG ATCGCTGCCT GTTCCGTCCT ATCCTGCCGG AAGATGAGCC CCAACTGCGA
CAATTCATCG CCCAGGTCAC CAAAGAGGAT CTTTACTACC GTTATTTCAG CGAGATCAAC
GAATTCACCC ATGAAGATTT AGCCAACATG ACGCAGATCG ACTACGATCG AGAAATGGCC
TTTGTGGCCG TGAGGCGGAT GGACAACGCT GAAGAGATCC TCGGCGTAAC GCGCGCGATC
TCCGATCCTG ACAACGTAGA TGCCGAGTTT GCCGTATTGG TGCGTTCAGA TCTCAAAGGG
TTGGGTTTAG GACGCCGTTT AATGGAGAAA TTGATTGCCT ATACTCGCGA TCACGGATTG
AAGCGGCTGA ACGGTATTAC GATGCCGAAC AATCGCGGCA TGGTCGCGTT GGCCAGAAAA
CTGGGATTTC AGGTCGATAT TCAGCTCGAC GAGGGCATCG TGGGATTGAC GCTGAATCTG
GCCAAATGTG ATGAATCGTG A
 
Protein sequence
MSQQGLEALL RPKSIAVIGA SMKPHRAGYL MMRNLLAGGF NGPVLPVTPA WKAVLGVMAW 
PDIASLPFTP DLAILCTNAS RNLALLDALG AKGCKTCIIL SAPTSQHEEL LACARHYKMR
LLGPNSLGLL APWQGLNASF SPVPIKQGKL AFISQSAAVS NTILDWAQQR EMGFSYFIAL
GDSLDIDVDE LLDYLARDSK TSAILLYLEQ LSDARRFVSA ARSASRNKPI LVIKSGRSPA
AQRLLNTSAG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGDR LMIISNGAAP
AALALDELWS RNGKLATLSE ETCLQLRQAL PAHIDIANPL DLCDDASSEH YVKTLDILLA
SQDFDALMVI HSPSAAAPGT ESAHALIETI KRHPRGKFVT LLTNWCGEFS SQEARRLFSE
AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPSNLT SNTAEAHNLL QRAIAEGAAS
LDTHEVQPIL HAYGLHTLPT WIASDSAEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML
YLRTASEVQQ AANAIFDRVK MAWPQARIHG LLVQSMANRA GAQELRVVVE HDPVFGPLIM
LGEGGVEWRP EEQAVVALPP LNMNLARYLV IQGIKQRKIR ARSALRPLDI VGLSQLLVQV
SNLIVDCPEI QRLDIHPLLA SASEFTALDV TLDIAPFDGD NESRLAVRPY PHQLEEWVEM
KNGDRCLFRP ILPEDEPQLR QFIAQVTKED LYYRYFSEIN EFTHEDLANM TQIDYDREMA
FVAVRRMDNA EEILGVTRAI SDPDNVDAEF AVLVRSDLKG LGLGRRLMEK LIAYTRDHGL
KRLNGITMPN NRGMVALARK LGFQVDIQLD EGIVGLTLNL AKCDES