Gene SeHA_C1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1954 
Symbol 
ID6487610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1901530 
End bp1903059 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content49% 
IMG OID642742159 
Producttetratricopeptide repeat protein 
Protein accessionYP_002045802 
Protein GI194448382 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTT CATCAATTTT CAAAATAATA ATATTAAATA TATTTATTTC AAAAATGGCT 
CACGCTGCTG TCTGCGAGGA GCGCTATCCC GCCGACTCAG AAGAATGCCA GTACGTTCAG
GAATTAGAGC AAAAAGCGGA ACAAGGAGAT GAAAGCGCGC AGTTCTCGCT TGGAAGCTGG
TATGCGGAGG GGCGATACGT TAAGCCTGAT TATAAACTAG CCATAAAATG GCTGGAGAAA
GCGGGTAAAC AAGGTTCTGA TTTTTCCTAT TTCATCCTTG GCTATCATTA CAACTATGGT
GAAAATTTTC CACTTAGTCG ACAAAAAGCG CTGGAGTGGT ATCGCAAGGC GGCAGAGCTA
GGGGATAGTA GTACGCAAGA AATCCTCGGT GACGCCTATA TGTATGGCGA TGGGTTTCCC
CAAAATACCC AGCTAGCGCT GGAGTGGTAT CGAAAAGCCG CCTCTCCAAC CAATGATGCG
GGCGTTGTCC GCGGACAGGG TTCAGCTTCG TCAGCACAAT TTAAGCTTGG AGTAATGTAC
GCCCACGGTC AGGGCGTTCC TCAGGATTAT CAGCAAACGG CGATCTTGAT GCGTAAAGCG
GCGGAAAATA TGTACTACCC CGCGCAACTT TATCTCGGCG TTGCCTATTT TTATGGTGAA
GGCGTGCCTC AGGATTATCG TCAGGCAGTT TACTGGCTTA ATGAAGGTAT ACCAGGCAGC
TATACGCCAG GCCACATTCC GCTGAATGCG CTCTATGATA AAGCGCATCC CGCTGACCGG
GTTCACTCTC AGACGTGGTA TCGAAAAACA GCGCAACGCG TGATGGCAAA GGTACAGTAC
AATTTTGGCG TATGGTATTA CAACGGTTAT CACTTATTGA AAGATCACAA CCTGGCGCTG
GAGTGGTATC GCAGAGCCGC AGCGCAAGGA TTGGCCGAGG CGCAGGATGC GATCGGCGTG
ATGTTTATGC AGGGTGAGGG CGTTTCTCAG GACTACCAAC AGGCGCTGGC GTGGTATCGC
AAAGCCGCTC GCCAGGGGCT GCCTGCTGCG CAAACGCATC TGGGCATCAT GTCTGCATTT
GGTCGCGGCG TCGCGCAAAG CGACAGACAG GCTATCGCAT GGTATCGAAA AGCAGCAAAA
CAGGATTTTG CGAAAGCGCA ATATCAGCTT GGCGTAGCAT ATAGCACGGG AAGAGGCGTG
CCTGAAAATA GCCGGAATGC GCTGAAGTGG TACCTCAAAG CGGCAGAGCA GGGGTTTACT
CCGGCACAAT CAGCGCTTGG GGAAATTTAT GCTCATGGGC GCCAGGGTGT GCCTAAAGAT
AATAAGCAGG CCTATATTTG GTATTACATG GCAAGTATGT ATACGGAAAA ATCTAAGGAT
GATTGTTCAG CGCTCATTGC CGAAAGAAAC CGACTCAAGG GAACGCTCAC CCCAGATCAA
CTTAGCGAGA CATATGCCGC CTTTAATCTC ATCTGGCGAA AAATTGATCA GTCAAAGGAG
GCGAAAAAGA TTGCCAGGAA GAAGTATTGA
 
Protein sequence
MRFSSIFKII ILNIFISKMA HAAVCEERYP ADSEECQYVQ ELEQKAEQGD ESAQFSLGSW 
YAEGRYVKPD YKLAIKWLEK AGKQGSDFSY FILGYHYNYG ENFPLSRQKA LEWYRKAAEL
GDSSTQEILG DAYMYGDGFP QNTQLALEWY RKAASPTNDA GVVRGQGSAS SAQFKLGVMY
AHGQGVPQDY QQTAILMRKA AENMYYPAQL YLGVAYFYGE GVPQDYRQAV YWLNEGIPGS
YTPGHIPLNA LYDKAHPADR VHSQTWYRKT AQRVMAKVQY NFGVWYYNGY HLLKDHNLAL
EWYRRAAAQG LAEAQDAIGV MFMQGEGVSQ DYQQALAWYR KAARQGLPAA QTHLGIMSAF
GRGVAQSDRQ AIAWYRKAAK QDFAKAQYQL GVAYSTGRGV PENSRNALKW YLKAAEQGFT
PAQSALGEIY AHGRQGVPKD NKQAYIWYYM ASMYTEKSKD DCSALIAERN RLKGTLTPDQ
LSETYAAFNL IWRKIDQSKE AKKIARKKY