Gene SeHA_C0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0540 
Symbol 
ID6490911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp538902 
End bp540395 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content40% 
IMG OID642740806 
Producttetratricopeptide repeat protein 
Protein accessionYP_002044473 
Protein GI194450922 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.653123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAC GGAAATGGAG TATTTTGTTT CTGCTTGGGC TTTCTTTCGC ATCGGTATGT 
CAGGCAGGTG ATGATATTGA CGCGTTGTAT AGCCGTATTG CGCAGAAAGA TATTCAGGCG
CTAAATGAGC TAAAAGCGTT AGCCAAAGAT AATAATGCTC AAGCGCTGGC GGAGTTGGGG
TTTATTTATG AGTACGGTGT TTCAGTATCT GTGGATATTC CGCAAGCAAT AAAGTACTAC
GAGCAGGCTT GTGATTTGGA CGGTGATTAT GGTTGCTTTA ATGCAGCTTA TTTCTATGAA
TATGGCATCG GTACCCAAAA GGACATTACG CAAGCGAAAA CATTAGCAAA ACAGCTTAGA
GAAAAAATTA ATCTATCCAA TATCAATCTC GATAAAAAGG TGAGTGAACG TATAATTGGG
AGTGTTTATA GTAATAAACT TGAGGCATAT AGAGACCTTT CATTTCGTCC TCATTTTATT
CAAGGCTTAA GTTTCTATTT TTATGCCCCG CAATCAGATG AGCGAAAACT GCTGAGTCGA
ATAGGTTTTG ACTCATCACA TCTTGCGCGG ATAGCGATAC TCTGGGCGAG AGAGGGTGAT
CCTGAGGTCG CGTATCAGAC GGCAAAACTG GTCTCTACTT TGTATTTTAA TAATGAAACG
AAAACGATAG ATATTGCTGA AGCGCTGAAA TGGCTCCGAA TTTCGGCCGA GAAGGGGGAT
GCCGACTCAC AGACGCTGTT AGGTTTTCTT TATGAACACG CTGGATTGGG ATTACAACCA
GATGGAGAGA AAGCCAGGAA ATGGTATGAA ATGGCGGCGC AGCAAGGTAA TGGAGAAGCA
TTGTATACAT TAGGTCGTAT GTACTATTCT GGCGTAATGG TTAACGTTGA TTACGATAAA
GCGCTGTATT TTTTTAAAAA AGCTTATGAG AAAGAATTAC AGGCGGCCGC TGATTATTTA
GCACAGATGT ATTTTAATGG CCAGAGTGTA GATGTCGATT GCCAACAATC CTGGCACTAT
TACGATAACA GCTATATAAA AAAGATGACA CAACGCGATT ATCTTGATTA TTGTGAAAAA
GATCGAAAGC GCCGTAATGA TTTTAGTCAG CAACTTCCTG AGCTGACATT AGAAAAATAT
GCGGGATTAT TTGGCAGGAT TGATAATATT CCTCTGTGTC AGATTGGATT TGTTGTCAAT
ACGAATAAGT TAATTCATGT TGCGAACTTA CGTGTGGAAT TGATATTAAA AAATGATGCT
GGAGTTAGTG ACGAAAGAAT GGTCGCTTTT CCTCCTTTGG GTCTCAATAC TCTGGGTGCT
GAACAGGGTA TGGGGGATTC TTTTAAGTCG ATGGGATATC TCTTGATGAA AAACGGTGAC
TTGTGTGATT ACCATAAACT TACTTTCACC GTGAAGTCAG CGACGGCAAC AATCAATGGT
AAAAAAGTTG ATTTACTGAA AACGGATAAT TTACATATTA TTCAGAATCG ATAA
 
Protein sequence
MIKRKWSILF LLGLSFASVC QAGDDIDALY SRIAQKDIQA LNELKALAKD NNAQALAELG 
FIYEYGVSVS VDIPQAIKYY EQACDLDGDY GCFNAAYFYE YGIGTQKDIT QAKTLAKQLR
EKINLSNINL DKKVSERIIG SVYSNKLEAY RDLSFRPHFI QGLSFYFYAP QSDERKLLSR
IGFDSSHLAR IAILWAREGD PEVAYQTAKL VSTLYFNNET KTIDIAEALK WLRISAEKGD
ADSQTLLGFL YEHAGLGLQP DGEKARKWYE MAAQQGNGEA LYTLGRMYYS GVMVNVDYDK
ALYFFKKAYE KELQAAADYL AQMYFNGQSV DVDCQQSWHY YDNSYIKKMT QRDYLDYCEK
DRKRRNDFSQ QLPELTLEKY AGLFGRIDNI PLCQIGFVVN TNKLIHVANL RVELILKNDA
GVSDERMVAF PPLGLNTLGA EQGMGDSFKS MGYLLMKNGD LCDYHKLTFT VKSATATING
KKVDLLKTDN LHIIQNR