Gene SeHA_C4082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4082 
Symbol 
ID6490894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3964743 
End bp3965780 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content51% 
IMG OID642744180 
Productvirulence protein 
Protein accessionYP_002047784 
Protein GI194450883 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA AATACTTAAC CCAATCCCCG GCAGGCGAAT TTGTTATGTT TGCCAGCGAT 
GACGGTGAAG TTCGTGTGGA GTGCCGCTTT GAGCAAGAGA CGCTATGGCT CCCTCAGGCA
ACCATCGCCA ACCTTTATCA GATCACTCCC CAGGCAGTTA CACAGCACAT TAAAGCGATC
TATGAAGAAG GCGAACTTGA GCAAAACGCA ACCTGTAAGT CTTACTTACA AGTTCAACAG
GAAGGTAGCC GTCAGGTAAG CCGCAACAGG CTTCACTACA GCCTGCCTGT CATCCTTGCT
GTCGGCTACC GCGTTCGTTC CCCGCGCGGC ACACAGTTCC GCCAGTGGGC AACCCAGACG
CTCCAGAAAT ACCTGATCAA AGGTTTTGTG ATGGACGATG AGCGCCTGAA AAATCCGCCC
GTGGGTTCAT CGGCTGTACC CGACTATTTT GATGAGATGC TGGAGCGTAT CCGCGATATT
CGCGCCAGCG AACGTCGGGT TTATTTGCGG GTACGAGAGA TCTTTGCGTT AGCCGCCGAC
TATCAACCAT CGCTCAAAGA AACCACGCAA TTTTTTCAAA CCATCCAGAA CAAGTTGCAT
TTTGCCTGTA CCGGACATAC CGCTGCTGAA CTCATTCATC AGCGTGCTGA CGCCAGCCAG
CCGCATATGG GGCTGACCAG CTATAAAGGT GAAGAGGTAC GTAAGGATGA CGTGACGGTG
GCAAAAAATT ATCTCACTCA GGATGAAGTC AGCGAGCTTA ACCGCGTAGT TAACATGTGG
CTGGATTTTG CCGAGGATCA GGCCCGTCGT CGTCAGCAGA TCTTTTTACG CGACTGGCAG
GATAAGCTGG ATCAGTTCCT GCAATTTAAC GACCGTGAGG TTTTACAAGG CGCAGGTAAA
GTCACTAAGA AAATGGCCGA TGAAAAAGCG CAGGCGGAAT ATAGTCAGTT TGCTGAACAA
CAACGGCGCT TAAAAGAAGC CGAAGGTGAG AAGGATATCG CCGGTTTGCT ACAATGGGAA
ACAGAACCTA AAAAGTAG
 
Protein sequence
MADKYLTQSP AGEFVMFASD DGEVRVECRF EQETLWLPQA TIANLYQITP QAVTQHIKAI 
YEEGELEQNA TCKSYLQVQQ EGSRQVSRNR LHYSLPVILA VGYRVRSPRG TQFRQWATQT
LQKYLIKGFV MDDERLKNPP VGSSAVPDYF DEMLERIRDI RASERRVYLR VREIFALAAD
YQPSLKETTQ FFQTIQNKLH FACTGHTAAE LIHQRADASQ PHMGLTSYKG EEVRKDDVTV
AKNYLTQDEV SELNRVVNMW LDFAEDQARR RQQIFLRDWQ DKLDQFLQFN DREVLQGAGK
VTKKMADEKA QAEYSQFAEQ QRRLKEAEGE KDIAGLLQWE TEPKK