Gene SeD_A0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0479 
Symbol 
ID6873559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp495091 
End bp496584 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content40% 
IMG OID642783705 
Producttetratricopeptide repeat protein 
Protein accessionYP_002214392 
Protein GI198242362 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.786454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAC GGAAATGGAG TATTTTGTTT CTGCTTGGGC TTTCTTTCGC ATCGGTATGT 
CAGGCAGGTG GTGATATTGA CGCGTTGTAT AGCCGTATTG CGCAGAAAGA TATTCAGGCG
CTAAATGAGC TAAAAGCGTT AGCCAAAGAT AATAATGCTC AAGCGCTGGC GGAGTTGGGG
TTTATTTATG AGTACGGTGT TTCAGTATCT GTGGATATTC CGCAAGCAAT AAAGTACTAC
GAGCAGGCTT GTGATTTGGA CGGTGATTAT GGTTGCTTTA ATGCAGCTTA TTTCTATGAA
TATGGCATCG GTACCCAAAA GGACATTACG CAAGCGAAAA CATTAGCAAA ACAGCTTAGA
GAAAAAATTA ATCTACCCAA TATCAATCTC GATAAAAAGG TGAGTGAACG TATAATTGGG
AGTGTTTATA GTAATAAACT TGAGGCATAT AGAGACCTTT CATTTCGTCC TCATTTTATT
CAAGGCTTAA GTTTCTATTT TTATGCCCCG CAATCAGATG AGCGAAAACT GCTGAGTCGA
ATAGGTTTTG ACTCATCACA TCTTGCGCGG ATAGCGATAC TCTGGGCGAG AGAGGGTGAT
CCTGAGGTCG CGTATCAGAC GGCAAAACTG GTCTCTACTT TGTATTTTAA TAATGAAACG
AAAACGATAG ATATTGCTGA AGCGCTGAAA TGGCTCCGAA TTTCGGCCGA GAAGGGGGAT
GCCGACTCAC AGACGCTGTT AGGTTTTCTT TATGAACACG CTGGATTGGG ATTACAACCA
GATGGAGAGA AAGCCAGGAA ATGGTATGAA ATGGCGGCGC AGCAAGGTAA TGGAGAAGCA
TTGTATACGT TAGGTCGTAT GTACTATTCT GGCGTAATGG TTAACGTTGA TTACGATAAA
GCGCTGTATT TTTTTAAAAA AGCTTATGAG AAAGAATTAC AGGCGGCCGC TGATTATTTA
GCACAGATGT ATTTTAATGG CCAGAGTGTA GATGTCGATT GCCAACAATC CTGGCACTAT
TACGATAACA GCTATATAAA AAAGATGACA CAACGCGATT ATCTTGATTA TTGTGAAAAA
GATCGAAAGC GCCGTAATGA TTTTAGTCAG CAACTTCCTG AGCTGACATT AGAAAAATAT
GCGGGATTAT TTGGCAGGAT TGATAATATT CCTCTGTGTC AGATTGGATT TGTTGTCAAT
ACGAATAAGT TAATTCATGT TGCGAACTTA CGTGTGGAAT TGATATTAAA AAATGATGCT
GGAGTTAGTG ACGAAAGAAT AGTCGCTTTT CCTCCTTTGG GTCTCAATAC TCTGGGTGCT
GAACAGGGTA TGGGGGATTC TTTTAAGTCG ATGGGATATC TCTTGATGAA AAACGGTGAC
TTGTGTGATT ACCATAAACT TACTTTCACC GTGAAGTCAG CGACGGCAAC AATCAATGGT
AAAAAAGTTG ATTTACTGAA AACGGATAAT TTACATATTA TTCAGAATCG ATAA
 
Protein sequence
MIKRKWSILF LLGLSFASVC QAGGDIDALY SRIAQKDIQA LNELKALAKD NNAQALAELG 
FIYEYGVSVS VDIPQAIKYY EQACDLDGDY GCFNAAYFYE YGIGTQKDIT QAKTLAKQLR
EKINLPNINL DKKVSERIIG SVYSNKLEAY RDLSFRPHFI QGLSFYFYAP QSDERKLLSR
IGFDSSHLAR IAILWAREGD PEVAYQTAKL VSTLYFNNET KTIDIAEALK WLRISAEKGD
ADSQTLLGFL YEHAGLGLQP DGEKARKWYE MAAQQGNGEA LYTLGRMYYS GVMVNVDYDK
ALYFFKKAYE KELQAAADYL AQMYFNGQSV DVDCQQSWHY YDNSYIKKMT QRDYLDYCEK
DRKRRNDFSQ QLPELTLEKY AGLFGRIDNI PLCQIGFVVN TNKLIHVANL RVELILKNDA
GVSDERIVAF PPLGLNTLGA EQGMGDSFKS MGYLLMKNGD LCDYHKLTFT VKSATATING
KKVDLLKTDN LHIIQNR