Gene SeHA_C4991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4991 
Symbol 
ID6490030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4868495 
End bp4870162 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content58% 
IMG OID642745033 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_002048602 
Protein GI194448109 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.170616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCAAT TCGTTTATAC CATGCATCGT GTCGGCAAAG TGGTTCCGCC GAAACGTCAT 
ATTCTGAAAA ACATCTCGCT GAGCTTCTTC CCTGGCGCCA AAATCGGCGT GCTGGGCCTT
AACGGCGCCG GTAAGTCTAC CCTGCTGCGC ATCATGGCGG GTCTCGATAA AGATATCGAG
GGCGAAGCGC GCCCGCAGCC CGGCATTAAG ATTGGCTACC TGCCGCAGGA ACCTCAGCTA
AACCCGGAAC ACACGGTACG CGAGTCGATT GAAGAGGCCG TTTCGGAAGT GGTTAACGCC
CTCAAACGTC TGGATGAAGT GTACGCGCTG TACGCCGATC CGGATGCCGA CTTCGACAAG
CTGGCCGCAG AGCAGGGCCG GCTTGAAGAG ATTATCCAGG CGCACGACGG TCATAATCTG
AACGTGCAGC TTGAGCGCGC TGCTGACGCC CTGCGTCTGC CGGACTGGGA TGCCAAAGTC
GAAAAACTGT CCGGCGGCGA GCGCCGCCGC GTGGCGCTGT GCCGTCTGTT GCTGGAAAAG
CCGGACATGC TGCTGCTCGA CGAACCCACC AACCACCTGG ATGCCGAATC TGTTGCGTGG
CTGGAACGTT TCCTGCACGA CTTCGAAGGC ACCGTCGTGG CGATTACCCA CGACCGTTAC
TTCCTCGATA ACGTCGCCGG CTGGATTCTG GAACTTGACC GCGGCGAAGG TATTCCGTGG
GAAGGCAACT ACTCCTCCTG GCTGGAGCAG AAAGATCAGC GTCTGGCGCA GGAAGCGTCT
GCCGAAGCGG CGCGCCGTAA ATCCATTGAG AAAGAGCTGG AGTGGGTGCG TCAGGGCGCG
AAAGGCCGTC AGTCGAAAGG TAAGGCGCGT CTGGCTCGCT TTGAAGAACT GAACAGCGTT
GAGTATCAGA AACGTAACGA AACCAACGAA CTGTTTATTC CACCAGGACC GCGTCTGGGC
GACAAAGTCA TTGAAGTCAG CAACCTGCGT AAATCCTACG GCGACCGCGT ACTGATCGAC
GACCTGAGCT TCTCGGTGCC GAAAGGCGCT ATCGTCGGGA TCATCGGGCC AAACGGCGCG
GGTAAATCGA CCCTGTTCCG CATGATGTCC GGTCAGGAGC AGCCTGATAG CGGCACCATT
ACGCTGGGTG AAACCGTCAA GCTGGCCTCG GTCGATCAGT TCCGCGACGC AATGGACAAC
AGCAAAACCG TCTGGGAAGA AGTGTCCGGC GGGCTGGATA TCATGAGGAT CGGCAACACT
GAAATGCCAA GCCGCGCCTA TGTAGGCCGC TTCAACTTCA AAGGCGTCGA TCAGGGCAAA
CGCGTTGGCG AACTGTCCGG CGGTGAGCGT GGTCGTTTGC ATCTGGCGAA GCTGCTGCAG
GTGGGCGGCA ACGTCCTGCT GCTTGACGAA CCGACGAACG ACCTGGATAT CGAAACCCTG
CGCGCGCTGG AAAACGCCCT GCTGGAGTTC CCTGGCTGCG CGATGGTTAT CTCGCACGAC
CGTTGGTTCC TCGACCGTAT CGCCACCCAC ATTCTGGATT ATCAGGATGA AGGTAAGGTG
GAATTCTTCG AAGGCAACTT TACCGAATAC GAAGAGTACA AGAAACGCAC GCTGGGCGCC
GAGGCGCTGG AGCCGAAGCG TATCAAGTAC AAGCGTATTG CCAAATAA
 
Protein sequence
MAQFVYTMHR VGKVVPPKRH ILKNISLSFF PGAKIGVLGL NGAGKSTLLR IMAGLDKDIE 
GEARPQPGIK IGYLPQEPQL NPEHTVRESI EEAVSEVVNA LKRLDEVYAL YADPDADFDK
LAAEQGRLEE IIQAHDGHNL NVQLERAADA LRLPDWDAKV EKLSGGERRR VALCRLLLEK
PDMLLLDEPT NHLDAESVAW LERFLHDFEG TVVAITHDRY FLDNVAGWIL ELDRGEGIPW
EGNYSSWLEQ KDQRLAQEAS AEAARRKSIE KELEWVRQGA KGRQSKGKAR LARFEELNSV
EYQKRNETNE LFIPPGPRLG DKVIEVSNLR KSYGDRVLID DLSFSVPKGA IVGIIGPNGA
GKSTLFRMMS GQEQPDSGTI TLGETVKLAS VDQFRDAMDN SKTVWEEVSG GLDIMRIGNT
EMPSRAYVGR FNFKGVDQGK RVGELSGGER GRLHLAKLLQ VGGNVLLLDE PTNDLDIETL
RALENALLEF PGCAMVISHD RWFLDRIATH ILDYQDEGKV EFFEGNFTEY EEYKKRTLGA
EALEPKRIKY KRIAK