Gene SeAg_B2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2098 
Symbol 
ID6794235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2020531 
End bp2022492 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content41% 
IMG OID642776318 
Producttetratricopeptide repeat (TPR)-containing protein 
Protein accessionYP_002146946 
Protein GI197248589 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.903394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTATC TCGACGATTT TCCCAAGCGC GATCAAAATC ATGTCAACGA TACAATGGCC 
AAGACTGCGT TTGAGGCTTT TATCGCTTCG TCGGACGTGG TCCTCAAACA AGGTTCAGAT
GATAATGATT ACGGCAGCGA TTACCAGCTT GAAATAGTAC ACGATGGGAT GGCTACTAAT
GTTCGGCTTC AAGTGCAGTT AAAAGGAACT GCTGCAGATT TGAATGCTGA TGGGTCTGTC
AGTATTTCAG TTAAACGTTC AAATCTGAAT TACCTTCTGA TGAGTCCTGG TTCCCTTTAT
GTCTGTTTCC ATATTCCTAC AAACACGCTG AAGGTTACCT CTGCTCAGAG TGTTCTGGCA
CAATACCGAA ATACAGGCAA AGATTGGCAA TCTCAGAAAT CTGTTACCGT CAATTTCACT
GAAACTTTGA CCGATCAGCG CCTTATCAGG GTTGTCTCGC TAATTCGGCT TAGCTCTCTT
GATGCGAGAA ATCGCCGTGT CGCTCATAGT AATATCGATG ATAATAACAT GGTAGACTAT
GCTAGGGCTT CGCAAACTAT TTATGAAGTT TCTGAAGATA TTGATTCTGC GACTAAACAG
CTTGTGAATC TCTACCGTAG TAACCAGACT GAAATTATCA GTACTGCATA TGAACGATTC
AAAGCAATAT TAGGAGAAGA ACATCCAGCA ATGATTTATT GCTGGATGGC TGAGATTGAC
CTTGCAAGTG CCAATAAAAT TTTCGACCAT CATCGTATTG AATTGGGTAT TCTGAAAATG
AAGGCACTTT CACTTATTAA TGGTAAGGAG GATGCTGGTT TGCATTACTC CATTGGCAAC
GGTTTTGCTG CTCTGAATGA TTTTAACGGC GCGTTGAACG AATATGAGAT AGCATGTGAG
CTTAATAAAC AGAGCATTAA TGATGAGCTT ATGGCGATGA TTTACAAGAA TATGGGGGGG
AGCTACGCTG CGCTAGAAAA TGAAAAGCAG GCAGTTGAAT GCTATTTATT AGCATTAGAG
CATAATCCTC ATCTGGCCGA AGCACATTAT GCTTTAGGTC TTTATTATCA CAATACGAGC
CAGTTCGAAA TGGCCTTGGA GCATCTGGAT AAAACAATTT TTTCTAAAAA TACGCAAGGA
AACCTGATTA ATCTACAGGG CTGGCGTATA TCGACTCTCT TCAATGTTGG AGAAGGTAGG
TCTGCGTTTA GAGAGATTAA TACTCTACTG AGCCAGGCAG ATAAAGCGCA GTGGATATGG
TCATGGTGTT TAAAAATCGT CGCACAGTTT GGCCGTAAAT CAATTGAGAA TGCAAAACTC
AGCCTCCCAT TTTGGGAGTC AGTTTTGCGC CATTTCCCTA ATAACTCAGA CGTTCAACGA
GAATCGCTGC TGGCTATAAT CTACCTACAA AACCGCAATA TGAATAGTCA TAAAACTTAT
TCCCAGTTTA AAAATGATCT TGAGTCATAC TCAGATAATA TAGGGTCAGA TGCAGCGAGT
CTGCTCTGGG ACCTGTTGGG CCATTGGGCG GAAGATGAAG ATCGCGGAGA TGAAGCTATA
TTATGCTTTG AAAAAGCCTA TTCTCTTCAG AAAGGTGATT ACGGATTATG TTTTTCAATC
GCACTGAACA ACCAACAACG CTATGAAGAA AGTGAAAAAT TGATGAAATC CTATATATCA
GTTTTTCCTG ATGATGCTCA AGGTTGGTAT CAGTTGGCTA GTACCTATGA CTTGATGGGG
CAACTGGAGA AGTGCATCGC ATCCTATCGT CAAAGTTTGT CTCTAAATGT CGATAACGAC
CATGCTTGGT TTAATCTTGG GGGAGCATTT TTCAATATGG GTAATTACTC CGAAGCACGA
CAAATCTGGA AAGAAGCAGT AAACAGATAC CCAGACCATG AATTGACCGC AAAACTCAGA
GCTGATATAC CTTTCATACT TAGCGATGAG CCACTCCCAT AA
 
Protein sequence
MDYLDDFPKR DQNHVNDTMA KTAFEAFIAS SDVVLKQGSD DNDYGSDYQL EIVHDGMATN 
VRLQVQLKGT AADLNADGSV SISVKRSNLN YLLMSPGSLY VCFHIPTNTL KVTSAQSVLA
QYRNTGKDWQ SQKSVTVNFT ETLTDQRLIR VVSLIRLSSL DARNRRVAHS NIDDNNMVDY
ARASQTIYEV SEDIDSATKQ LVNLYRSNQT EIISTAYERF KAILGEEHPA MIYCWMAEID
LASANKIFDH HRIELGILKM KALSLINGKE DAGLHYSIGN GFAALNDFNG ALNEYEIACE
LNKQSINDEL MAMIYKNMGG SYAALENEKQ AVECYLLALE HNPHLAEAHY ALGLYYHNTS
QFEMALEHLD KTIFSKNTQG NLINLQGWRI STLFNVGEGR SAFREINTLL SQADKAQWIW
SWCLKIVAQF GRKSIENAKL SLPFWESVLR HFPNNSDVQR ESLLAIIYLQ NRNMNSHKTY
SQFKNDLESY SDNIGSDAAS LLWDLLGHWA EDEDRGDEAI LCFEKAYSLQ KGDYGLCFSI
ALNNQQRYEE SEKLMKSYIS VFPDDAQGWY QLASTYDLMG QLEKCIASYR QSLSLNVDND
HAWFNLGGAF FNMGNYSEAR QIWKEAVNRY PDHELTAKLR ADIPFILSDE PLP