Gene SeD_A1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1994 
Symbol 
ID6874414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1925808 
End bp1927448 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content53% 
IMG OID642785110 
Productshort chain acyl-CoA synthetase 
Protein accessionYP_002215776 
Protein GI198243624 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.194812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.059614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTTA CATTAACGTT TGACGCCGCG CGGCGGAAAA CCTATCGCGA GTCCGGCTAC 
TGGGGCGATG CTTCACTGGG CGACTACTGG CGGCAAACCG CACGCGCCGT ACCCGATAAA
ATCGCCGTGG TCGATAATCA TGGCGCGTCC TGGACCTACG CTGCGCTCGA CTGCGCGGCA
AGCCGCCTGG CAAACTGGTT ACTGTCTCAG GGGATTCAAC CGGGCGATCG CGTGGCCTTT
CAGCTTCCGG GCTGGTGTGA GTTTACCCTC ATTTATCTGG CCTGCCTGAA AACAGGCGCG
GTATCTGTCC CGCTACTTCC CGCCTGGCGA GAAGCAGAGC TGGTTTGGGT GCTGAACAAA
TGTCAGGCTA AAATATTCTT CGCCCCCACC GTATTCAAAC AGAATCGTCC GGTCGATCTT
ATCCTTCCTC TACAAAATCA ACTGCGCCAT CTGACGCATA TTGTCGGCGT GGATAAACTG
GCGCCTGCCA CCACAGCGCT TGCGCTTAGT CAAATCATCG ACCGCAGCGA ACCGTTGCAG
TCAGACATTA ACATTCACGG TGATGAGCTG GCGGCAGTGC TGTTTACCTC CGGCACAGAA
GGAATGCCGA AAGGGGTGAT GTTGACCCAC AATAATATTC TTGCCAGCGA ACGGGCGTAT
TGCGCGCGGT TGAATTTAAC CTGGCAAGAT GTGTTCCTGA TGCCTGCGCC ACTGGGCCAC
GCCACCGGAT TTTTACACGG CGTCACCGCG CCCTTTTTAA TCGGGGCGCG TAGCGTATTG
CTGGACATCT TTACCCCAGA AGCCTGCCTT ACCTTATTAG CGCAGCAACG CTGTACCTGT
ATGTCAGGCG CGACGCCGTT TATTTACGAT CTGCTTTGTG CCGTTGAGCA ACAGCCTGCC
GACCTCTCCT CACTGCGATT CTTCTTATGC GGCGGCACGA CTATTCCCAA AAAGGTTGCC
CGCGACTGCC AGCAGCGCGG TATCAAATTA TTGAGTATTT ACGGTTCTAC AGAAAGTTCT
CCACACTCGA TGGTTAATCT GGGTGATTCG ACTTCACGCA TGATGAACAC CGATGGTTAT
GCCGCCACCG GTGTAGAAAT TAAAATCGTG GATGAAGATC GCAATACGCT TCCAGCAGGC
CACGAAGGCG AAGAAGCCTC GCGCGGGCCG AACGTTTTTA TGGGATACCT GGACGAGCCG
GAACTTACCG CCCGCGCACT GGATAATGAG GGCTGGTATT ACAGCGGCGA CCTCTGCCGC
ATGGATGAAG ACGGTTATAT CAAAATTACC GGGCGCAAAA AGGATATTAT TATACGTGGA
GGTGAAAATA TCAGCAGTCG CGAGGTGGAG GATATTTTAT TGCAACACCC CCGCATACAC
GATGCTTGCG TCGTTGCTAT GCCTGATGAA CGCTTAGGCG AACGTTCATG CGCGTATGTA
GTCTTAAAAC CACCGCACCT TTCGCTGACG CTGGAAGAGG TGATCGCTTT TTTCAGTCGA
AAACGGGTAG CGAAGTATAA ATATCCGGAA CGGATCGTCA TCGTAGAAAA ACTGCCCAGA
ACCGCCTCTG GCAAAGTGCA GAAGTTTCTG TTGCGGCAGG ATATTATTGA GCGGCTGCGC
CAGGAGCACA CTGCGGTATA A
 
Protein sequence
MSVTLTFDAA RRKTYRESGY WGDASLGDYW RQTARAVPDK IAVVDNHGAS WTYAALDCAA 
SRLANWLLSQ GIQPGDRVAF QLPGWCEFTL IYLACLKTGA VSVPLLPAWR EAELVWVLNK
CQAKIFFAPT VFKQNRPVDL ILPLQNQLRH LTHIVGVDKL APATTALALS QIIDRSEPLQ
SDINIHGDEL AAVLFTSGTE GMPKGVMLTH NNILASERAY CARLNLTWQD VFLMPAPLGH
ATGFLHGVTA PFLIGARSVL LDIFTPEACL TLLAQQRCTC MSGATPFIYD LLCAVEQQPA
DLSSLRFFLC GGTTIPKKVA RDCQQRGIKL LSIYGSTESS PHSMVNLGDS TSRMMNTDGY
AATGVEIKIV DEDRNTLPAG HEGEEASRGP NVFMGYLDEP ELTARALDNE GWYYSGDLCR
MDEDGYIKIT GRKKDIIIRG GENISSREVE DILLQHPRIH DACVVAMPDE RLGERSCAYV
VLKPPHLSLT LEEVIAFFSR KRVAKYKYPE RIVIVEKLPR TASGKVQKFL LRQDIIERLR
QEHTAV