Gene SeHA_C3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3065 
Symbol 
ID6488587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2994370 
End bp2996031 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content43% 
IMG OID642743220 
Productinvasion protein regulator 
Protein accessionYP_002046839 
Protein GI194450254 
COG category[K] Transcription 
COG ID[COG3710] DNA-binding winged-HTH domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.359189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATT TTAATCCTGT TCCTGTATCG AATAAAAAAT TCGTCTTTGA TGATTTCATA 
CTCAACATGG ACGGCTCCCT GCTACGCTCA GAAAAGAAAG TCAATATTCC GCCAAAAGAA
TATGCCGTTC TGGTCATCCT GCTCGAAGCC GCCGGCGAGA TTGTGAGTAA AAACACCTTA
CTGGACCAGG TATGGGGCGA CGCGGAAGTT AACGAAGAAT CTCTTACCCG CTGTATTTAT
GCCTTACGAC GTATTCTGTC GGAAGATAAA GAGCATCGTT ACATTGAAAC ACTGTACGGA
CAGGGCTATC GGTTTAATCG TCCGGTCGTA GTGGTGTCTC CGCCAGCGCC GCAACCTACG
ACTCATACAT TGGCGATACT TCCTTTTCAG ATGCAGGATC AGGTTCAATC CGAGAGTCTG
CATTACTCTA TCGTGAAGGG ATTATCGCAG TATGCGCCCT TTGGCCTGAG CGTGCTGCCG
GTGACCATTA CGAAGAACTG CCGCAGTGTT AAGGATATTC TTGAGCTCAT GGATCAATTA
CGCCCCGATT ATTATATCTC CGGGCAGATG ATACCCGATG GTAATGATAA TATTGTACAG
ATTGAGATAG TTCGGGTTAA AGGTTATCAC CTGCTGCACC AGGAAAGCAT TAAGTTGATA
GAACACCAAC CCGCTTCTCT CTTGCAAAAC AAAATTGCGA ATCTTTTGCT CAGATGTATT
CCCGGACTTC GCTGGGACAC AAAGCAGATT AGCGAGCTAA ATTCGATTGA CAGTACTATG
GTTTACTTAC GCGGTAAGCA TGAGTTAAAT CAATACACCC CCTATAGCTT ACAGCAAGCG
CTTAAATTGC TGACTCAATG CGTTAACATG TCGCCAAACA GCATTGCGCC TTACTGTGCG
CTGGCAGAAT GCTACCTCAG CATGGCGCAA ATGGGGATTT TTGATAAACA AAACGCTATG
ATCAAAGCTA AAGAACATGC GATTAAGGCG ACAGAGCTGG ACCACAATAA TCCACAAGCT
TTAGGATTAC TGGGGCTAAT TAATACGATT CACTCAGAAT ACATCGTCGG GAGTTTGCTA
TTCAAACAAG CTAACTTACT TTCGCCCATT TCTGCAGATA TTAAATATTA TTATGGCTGG
AATCTTTTCA TGGCTGGTCA GTTGGAGGAG GCCTTACAAA CGATTAACGA GTGTTTAAAA
TTGGACCCAA CGCGCGCAGC CGCAGGGATC ACTAAGCTGT GGATTACCTA TTATCATACC
GGTATTGATG ATGCTATACG TTTAGGCGAT GAATTACGCT CACAACACCT GCAGGATAAT
CCAATATTAT TAAGTATGCA GGTTATGTTT CTTTCGCTTA AAGGTAAACA TGAACTGGCA
CGAAAATTAA CTAAAGAAAT ATCCACGCAG GAAATAACAG GACTTATTGC TGTTAATCTT
CTTTACGCTG AATATTGTCA GAATAGTGAG CGTGCCTTAC CGACGATAAG AGAATTTCTG
GAAAGTGAAC AGCGTATAGA TAATAATCCG GGATTATTAC CGTTAGTGCT GGTTGCCCAC
GGCGAAGCTA TTGCCGAGAA AATGTGGAAT AAATTTAAAA ACGAAGACAA TATTTGGTTC
AAAAGATGGA AACAGGATCC CCGCTTGATT AAATTACGGT AA
 
Protein sequence
MPHFNPVPVS NKKFVFDDFI LNMDGSLLRS EKKVNIPPKE YAVLVILLEA AGEIVSKNTL 
LDQVWGDAEV NEESLTRCIY ALRRILSEDK EHRYIETLYG QGYRFNRPVV VVSPPAPQPT
THTLAILPFQ MQDQVQSESL HYSIVKGLSQ YAPFGLSVLP VTITKNCRSV KDILELMDQL
RPDYYISGQM IPDGNDNIVQ IEIVRVKGYH LLHQESIKLI EHQPASLLQN KIANLLLRCI
PGLRWDTKQI SELNSIDSTM VYLRGKHELN QYTPYSLQQA LKLLTQCVNM SPNSIAPYCA
LAECYLSMAQ MGIFDKQNAM IKAKEHAIKA TELDHNNPQA LGLLGLINTI HSEYIVGSLL
FKQANLLSPI SADIKYYYGW NLFMAGQLEE ALQTINECLK LDPTRAAAGI TKLWITYYHT
GIDDAIRLGD ELRSQHLQDN PILLSMQVMF LSLKGKHELA RKLTKEISTQ EITGLIAVNL
LYAEYCQNSE RALPTIREFL ESEQRIDNNP GLLPLVLVAH GEAIAEKMWN KFKNEDNIWF
KRWKQDPRLI KLR