Gene SeHA_C2456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2456 
Symbol 
ID6491691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2367742 
End bp2369331 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content56% 
IMG OID642742639 
Productglutathione import ATP-binding protein GsiA 
Protein accessionYP_002046274 
Protein GI194450972 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.00000644784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACATATC CATTGCTTGC GATTGAAAAT TTATCGGTAG GTTTCCGTCA GCAACAGCAC 
GTGCGTCCTG TCGTTAACGC CATTTCGTTA CAGGTGAACG CCGGGGAAAC GCTGGCGCTG
GTCGGCGAGT CCGGGTCAGG AAAAAGCGTG ACGGCGCTTT CCATTCTGCG TCTTTTGCCT
ACCCCTCCCG CCGTCTACCT CTCCGGCGAT ATTCGCTTTC ACGGCGAATC ATTGCTTCAT
GCCAGCGAGC AGACGCTACG CGGCGTACGC GGCAATAAAA TCGCCATGAT TTTTCAGGAA
CCGATGGTTT CACTTAACCC GCTGCATACG CTGGAAAAAC AGCTCTATGA AGTGCTGTCG
CTCCATCGGG GAATGCGTCG GGAGGCGGCA AGAGCGGAGA TGATCGGTTG TCTGGATCGG
GTCGGTATCC GCCAGGCGAG CCAGCGTCTG CGCGATTACC CTCATCAGCT TTCCGGCGGC
GAACGCCAGC GCGTCATGAT AGCCATGGCG CTGTTAACAC GGCCGGAATT ACTGATCGCC
GATGAGCCGA CCACCGCCCT CGACGTCTCC GTACAGGCGC AGATTTTATC TCTACTACGG
GAACTCCAGC GCGAGCTTAA TATGGGATTA CTGTTTATCA CCCATAACCT CAGCATTGTA
AAAAAACTGG CGGATTCAGT AGCGGTAATG CAACACGGCA AGTGCGTAGA GAACCAACGC
GCCGACACGC TGCTCTCCGC GCCGACCCAT CCGTACACGC AAAAACTACT CAACAGCGAA
CCCACAGGCG ATCCGGTTCC GCTCCCCGCC GGGCAGACGC CGTTGCTGGA GGTGGACAGG
CTGCGCGTCG CCTTCCCGAT CCGCAAAGGC ATTCTGAAGC GCGTCGTGGA TCATAATGTG
GTGGTTAACA ATATCAGTTT CACCCTGCAT CCAGGCGAAA CGCTGGGTCT GGTCGGCGAG
TCAGGATCGG GAAAAAGCAC CACCGGTCTG GCGCTGTTAC GGCTTATCCG CTCCGAAGGC
CGCATCGTGT TTGACGGTCA ATCGCTGGAT ACGTTAAACC GCCACCAGCT TTTACCTGTT
CGCCACCGTA TCCAGGTCGT ATTCCAGGAC CCGAACTCAT CGCTAAACCC GCGTTTAAAC
GTATTGCAAA TTATCGAAGA AGGCCTGCGC GTCCACCAGC CTACGCTTTC AGGCGCGCAG
CGCGAACAGC AGGTGAAAGC GGTCATGATG GAAGTCGGCC TGGACCCGGA AACGCGGCAT
CGTTACCCCG CTGAGTTTTC CGGCGGCCAG CGTCAACGTA TCGCCGTCGC CAGGGCACTG
ATTTTAAAAC CGTCGCTTAT TATTCTGGAT GAACCGACCT CATCGCTGGA TAAAACCGTT
CAGGCGCAGA TTCTTGCCCT CCTGAAATCG CTCCAGCAAA AGCACCGTCT GGCCTATATC
TTCATTAGCC ACGATCTGCA TGTAGTACGC GCGCTGTGCC ATCAGGTTAT TGTGCTGCGG
CAGGGGGAGG TGGTTGAACA GGGGCAATGC GAGCGCGTGT TTACCGCACC GCAACAGGCC
TATACGCGTC AGCTACTCGC GTTAAGCTGA
 
Protein sequence
MTYPLLAIEN LSVGFRQQQH VRPVVNAISL QVNAGETLAL VGESGSGKSV TALSILRLLP 
TPPAVYLSGD IRFHGESLLH ASEQTLRGVR GNKIAMIFQE PMVSLNPLHT LEKQLYEVLS
LHRGMRREAA RAEMIGCLDR VGIRQASQRL RDYPHQLSGG ERQRVMIAMA LLTRPELLIA
DEPTTALDVS VQAQILSLLR ELQRELNMGL LFITHNLSIV KKLADSVAVM QHGKCVENQR
ADTLLSAPTH PYTQKLLNSE PTGDPVPLPA GQTPLLEVDR LRVAFPIRKG ILKRVVDHNV
VVNNISFTLH PGETLGLVGE SGSGKSTTGL ALLRLIRSEG RIVFDGQSLD TLNRHQLLPV
RHRIQVVFQD PNSSLNPRLN VLQIIEEGLR VHQPTLSGAQ REQQVKAVMM EVGLDPETRH
RYPAEFSGGQ RQRIAVARAL ILKPSLIILD EPTSSLDKTV QAQILALLKS LQQKHRLAYI
FISHDLHVVR ALCHQVIVLR QGEVVEQGQC ERVFTAPQQA YTRQLLALS