Gene SeHA_C4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4231 
Symbol 
ID6488236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4116752 
End bp4118272 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content53% 
IMG OID642744324 
Productputative ATP-dependent protease 
Protein accessionYP_002047922 
Protein GI194448893 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID[TIGR00368] Mg chelatase-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGG CGATTGTTCA TACCCGCGCC GCACTTGGCG TCAATGCGCC GCCTATCACT 
ATAGAGGTGC ATATCAGTAA TGGATTACCG GGATTAACCA TGGTTGGCCT ACCGGAAACC
ACGGTAAAAG AGGCGCGTGA CCGGGTACGT AGTGCGATTA TTAATAGCGG ATATGAATTT
CCGGCTAAAA AAATAACCAT TAACCTTGCT CCAGCCGATC TACCGAAAGA GGGCGGAAGG
TATGACCTGC CTATTGCTGT TGCGCTTCTG GCCGCGTCTG AGCAGCTTAC AGCGTCGAAT
CTTGAGGCAT ATGAGCTGGT GGGTGAGTTA GCGCTTACGG GCGCATTACG CGGCGTTCCT
GGCGCAATAT CAAGTGCAAC GGAAGCCATC AAGGCCGGCA GAAATATTAT CGTCGCAACA
GAGAACGCGG CGGAGGTTGG GCTTATCAGC AAAGAAGGTT GTTTTATCGC CGATCATCTA
CAAACCGTCT GCGCCTTTCT GGAAGGGAAA CACGCCCTGG AAAGACCTTT AGCTCAGGAT
ATGGCATCGC CTACCGCAAC TGCCGATCTT CGCGATGTGA TCGGTCAGGA GCAGGGTAAA
CGCGGCCTGG AGATTACAGC GGCAGGGGGA CATAATCTGC TATTGATCGG CCCGCCGGGT
ACGGGTAAAA CCATGCTGGC CAGTCGACTG AGCGGGATTC TTCCACCATT AAGCAATGAA
GAGGCGTTGG AAAGCGCCGC GATCCTCAGT CTGGTTAATG CCGATACGGT ACAAAAACGA
TGGCAGCAAC GCCCCTTTCG CTCACCTCAT CATAGCGCCT CACTTACTGC TATGGTCGGC
GGCGGCGCAA TACCCGCCCC GGGAGAGATA TCGCTGGCGC ACAACGGAAT TTTGTTCCTT
GATGAATTGC CTGAATTTGA GCGACGCACA CTGGATGCGC TACGTGAACC TATAGAATCC
GGTCAAATTC ATTTATCCCG TACCAGAGCG AAAATAACGT ACCCTGCCAG GTTCCAGTTA
ATCGCCGCAA TGAATCCCAG CCCGACCGGA CATTATCAGG GAAACCATAA TCGCTGCACG
CCAGAACAGA CACTACGTTA CCTTAATAGG TTGTCAGGCC CGTTTCTTGA TCGTTTTGAC
CTTTCGCTTG AGATACCGCT TCCTCCGCCC GGGATTCTTA GCCAACACGC CTCAAAGGGT
GAGAACAGCG CTACGGTAAA AAAGCGGGTC ATCGCCGCCC ATGAACGGCA GTACCGACGC
CAGAAGAAGT TAAACGCGCG TCTGGAGGGT CGCGAAATCC AAAAATACTG TGTTTTGCAT
CACGATGACG CCCGCTGGCT TGAAGACACG CTGGTGCATC TTGGATTATC CATTCGCGCC
TGGCAGCGTT TACTAAAAGT GGCCAGAACC ATTGCCGACA TAGAACTGGC TGACCAGATC
TCGCGTCAGC ATTTGCAGGA GGCGGTAAGC TATCGGGCGA TAGACAGGTT GTTAATTCAT
TTGCAAAAGC TGCTGGCGTA A
 
Protein sequence
MSLAIVHTRA ALGVNAPPIT IEVHISNGLP GLTMVGLPET TVKEARDRVR SAIINSGYEF 
PAKKITINLA PADLPKEGGR YDLPIAVALL AASEQLTASN LEAYELVGEL ALTGALRGVP
GAISSATEAI KAGRNIIVAT ENAAEVGLIS KEGCFIADHL QTVCAFLEGK HALERPLAQD
MASPTATADL RDVIGQEQGK RGLEITAAGG HNLLLIGPPG TGKTMLASRL SGILPPLSNE
EALESAAILS LVNADTVQKR WQQRPFRSPH HSASLTAMVG GGAIPAPGEI SLAHNGILFL
DELPEFERRT LDALREPIES GQIHLSRTRA KITYPARFQL IAAMNPSPTG HYQGNHNRCT
PEQTLRYLNR LSGPFLDRFD LSLEIPLPPP GILSQHASKG ENSATVKKRV IAAHERQYRR
QKKLNARLEG REIQKYCVLH HDDARWLEDT LVHLGLSIRA WQRLLKVART IADIELADQI
SRQHLQEAVS YRAIDRLLIH LQKLLA