Gene SeD_A4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4290 
Symbol 
ID6871663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4135457 
End bp4136977 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content53% 
IMG OID642787219 
Productputative ATP-dependent protease 
Protein accessionYP_002217839 
Protein GI198242242 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID[TIGR00368] Mg chelatase-related protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGG CGATTGTTCA TACTCGCGCC GCACTTGGCG TCAATGCGCC GCCTATCACT 
ATAGAGGTGC ATATCAGTAA TGGATTACCG GGATTAACCA TGGTTGGCCT ACCGGAAACC
ACGGTAAAAG AGGCGCGTGA CCGGGTACGT AGTGCGATTA TTAATAGCGG ATATGAATTT
CCGGCTAAAA AAATAACCAT TAACCTTGCT CCAGCCGATC TACCGAAAGA GGGCGGAAGG
TATGACCTGC CTATTGCTGT TGCGCTTCTG GCCGCGTCTG AGCAGCTTAC AGCGTCGAAT
CTTGAGGCAT ATGAGCTGGT GGGTGAGTTA GCGCTTACAG GCGCATTACG CGGCGTTCCT
GGCGCAATAT CAAGTGCAAC GGAAGCCATC AGGGCCGGCA GAAATATTAT CGTCGCAACA
GAGAACGCGG CGGAGGTTGG GCTTATCAGC AAAGAAGGAT GTTTTATCGC CGATCATCTA
CAAACCGTCT GCGCCTTTCT GGAAGGGAAA CACGCCCTGG AAAGACCTTT AGCTCAGGAT
ATGGCATCGT CTACCGCAAC TGCCGATCTT CGCGATGTGA TCGGTCAGGA GCAGGGTAAA
CGCGGCCTGG AGATTACAGC GGCAGGGGGA CATAATCTGC TATTGATCGG CCCGCCGGGT
ACGGGTAAAA CCATGCTGGC CAGTCGACTG AGCGGGATTC TTCCACCATT AAGCAATGAA
GAGGCGTTGG AAAGCGCCGC GATCCTCAGT CTGGTTAATG CCGATACGGT ACAAAAACGA
TGGCAGCAAC GTCCCTTTCG CTCACCTCAT CATAGCGCCT CACTTACTGC TATGGTCGGC
GGCGGCGCAA TACCCGCCCC GGGAGAGATA TCGCTGGCGC ACAACGGAAT TTTGTTCCTT
GATGAATTGC CTGAATTTGA ACGACGCACA CTGGATGCGC TACGTGAACC TATAGAATCC
GGTCAAATTC ATTTATCCCG TACCAGAGCG AAAATAACGT ACCCTGCCAG GTTCCAGTTA
ATCGCCGCAA TGAATCCCAG CCCGACCGGA CATTATCAGG GAAACCATAA TCGCTGCACG
CCAGAACAGA CACTACGTTA CCTTAATCGG TTGTCAGGCC CGTTTCTTGA TCGTTTTGAC
CTTTCGCTTG AGATACCGCT TCCACCGCCC GGGATTCTTA GCCAACACGC CTCAAAGGGT
GAGAGCAGCG CTACGGTAAA AAAGCGGGTC ATCGCCGCCC ATGAACGGCA GTACCGACGC
CAGAAGAAGT TAAACGCGCG TCTGGAGGGT CGCGAAATCC AAAAATATTG TGTTTTGCAC
CACGATGACG CCCGCTGGCT TGAAGACACG CTGGTGCATC TTGGATTATC CATTCGCGCC
TGGCAGCGTT TACTAAAAGT GGCCAGAACC ATTGCCGACA TAGAACTGGC TGACCAGATC
TCGCGTCAGC ATTTGCAGGA GGTGGTAAGC TATCGGGCGA TAGACAGGTT GTTAATTCAT
TTGCAAAAGC TGCTGGCGTA A
 
Protein sequence
MSLAIVHTRA ALGVNAPPIT IEVHISNGLP GLTMVGLPET TVKEARDRVR SAIINSGYEF 
PAKKITINLA PADLPKEGGR YDLPIAVALL AASEQLTASN LEAYELVGEL ALTGALRGVP
GAISSATEAI RAGRNIIVAT ENAAEVGLIS KEGCFIADHL QTVCAFLEGK HALERPLAQD
MASSTATADL RDVIGQEQGK RGLEITAAGG HNLLLIGPPG TGKTMLASRL SGILPPLSNE
EALESAAILS LVNADTVQKR WQQRPFRSPH HSASLTAMVG GGAIPAPGEI SLAHNGILFL
DELPEFERRT LDALREPIES GQIHLSRTRA KITYPARFQL IAAMNPSPTG HYQGNHNRCT
PEQTLRYLNR LSGPFLDRFD LSLEIPLPPP GILSQHASKG ESSATVKKRV IAAHERQYRR
QKKLNARLEG REIQKYCVLH HDDARWLEDT LVHLGLSIRA WQRLLKVART IADIELADQI
SRQHLQEVVS YRAIDRLLIH LQKLLA