Gene SeHA_C3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3791 
Symbol 
ID6488524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3653266 
End bp3654543 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content57% 
IMG OID642743903 
Producthypothetical protein 
Protein accessionYP_002047509 
Protein GI194447601 
COG category[S] Function unknown 
COG ID[COG3266] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0131119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAT TCAAACCAGA AGACGAGCTG AAACCCGATC CCAGCGATCG TCGTACTGGT 
CGTTCTCGTC AATCTTCAGA ACGCGATAAT GAGCCGCAGA TCAACTTTGA TGACGTTGAT
CTGGACGCCG ACGATCGCCG TCCGACGCGT ACGCGTAAAG CGCGTAGTGA AGAACCTGAA
GTTGAAGAAG AGTACGAATC CGATGAAGAC GATACGGTGG ACGAAGAGCG TGTTGAACGC
CGCCCACGTA AGCGTAAAAA AGCGGCCCAT AAGCCAGCCT CTCGTCAGTA CATGATGATG
GGCGTTGGCG TACTGGTGCT GCTGCTGTTG ATTATCGGTA TCGGCTCCGC GCTGAAAGCC
CCCTCAACGT CTTCCAGCGA GCCGTCGGCC TCTGGCGAAA AGAGTATCGA TCTTTCCGGT
AACGCCGCCG ATCAGGCGAA TGCGACCCAG CCTGCGCCAG GCGCCACCTC CGCAGAACAA
ACCGCGGGCA ATACGTCGCA GGATATTTCG TTGCCGCCGA TTTCTTCAAC GCCGACGCAG
GGACAGTCAC CTGTGGTCGC TGACGGTCAG CAGCGCGTGG AAGTGCAGGG CGATCTGAAT
AATGCGCTGA CGCAGAATCC AGAGCAGATG AATAATGTTG CGGTGAACTC TACGTTGCCG
ACAGAGCCTG CAACCGTCGC GCCAGTTCGC AATGGCAGCA CGACGCGTCA GGCGGCGGTT
AGCGAACCTG CCGAGCGTCA TACCACGCGT CCGGAACGTA AACAGGCCGT CATTGAACCT
AAGAAACCGC AGACCACGGC GAAAACCACC ACTGCGGAAC CGAAGAAACC GGTCGCGCCA
GTGAAACGCA CGGAACCGGC AGCGCCAGCC GCGACGCCGA AAGCGACCAC CACGACGGCT
GCGCCGACAG CGACGGCAAG CGCTGCGCCG GTACAAACCG CGAAGCCAGC GCAAGCCTCG
ACGACGCCTG TCGCAGGCGG CGGGAAAAGC GCCGGCAACG TTGGCGCATT AAAGAGCGCG
CCATCCAGCC ACTACACATT GCAGCTCAGT AGTTCTTCAA ATTACGACAA CCTGAACGGT
TGGGCGAAGA AAGAGAACCT GAAAAATTAT GTGGTATACG AGACGACGCG TAATGGACAA
CCGTGGTATG TGCTGGTAAC GGGGATGTAT GCTTCGAAAG AAGATGCTAA ACGTGCGGTG
TCCACCTTAC CTGCCGATGT GCAGGCGAAA AACCCGTGGG CAAAACCGTT GCATCAGGTT
CAGGCCGATC TGAAATAA
 
Protein sequence
MDEFKPEDEL KPDPSDRRTG RSRQSSERDN EPQINFDDVD LDADDRRPTR TRKARSEEPE 
VEEEYESDED DTVDEERVER RPRKRKKAAH KPASRQYMMM GVGVLVLLLL IIGIGSALKA
PSTSSSEPSA SGEKSIDLSG NAADQANATQ PAPGATSAEQ TAGNTSQDIS LPPISSTPTQ
GQSPVVADGQ QRVEVQGDLN NALTQNPEQM NNVAVNSTLP TEPATVAPVR NGSTTRQAAV
SEPAERHTTR PERKQAVIEP KKPQTTAKTT TAEPKKPVAP VKRTEPAAPA ATPKATTTTA
APTATASAAP VQTAKPAQAS TTPVAGGGKS AGNVGALKSA PSSHYTLQLS SSSNYDNLNG
WAKKENLKNY VVYETTRNGQ PWYVLVTGMY ASKEDAKRAV STLPADVQAK NPWAKPLHQV
QADLK