Gene SNSL254_A1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1890 
Symbol 
ID6486905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1849905 
End bp1851434 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content49% 
IMG OID642737259 
Producttetratricopeptide repeat protein 
Protein accessionYP_002041009 
Protein GI194445007 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.0490487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTT CATCAATTTT CAAAATAATA ATATTAAATA TATTTATTTC AAAAATGGCT 
CACGCTGCTG TCTGCGAGGA GCGCTATCCC GCCGACTCAG AAGAATGCCA GTACGTTCAG
GAATTAGAGC AAAAAGCGGA ACAAGGAGAT GAAAGCGCGC AGTTCTCGCT TGGAAGCTGG
TATGCGGAGG GGCGATACGT TAAGCCTGAT TATAAACTAG CCATAAAATG GCTGGAGAAA
GCGGGTAAAC AAGGTTCGGA TTTTTCCTAT TTCATCCTTG GCTATCATTA CAACTATGGT
GAAAATTTTC CACTTAGTCG ACAAAAAGCG CTGGAGTGGT ATCGCAAGGC GGCAGAGCTA
GGGGATAGTA GTACGCAAGA AATTCTCGGT GACGCCTATA TGTATGGCGA TGGGTTTCCC
CAAAATACCC AGCTAGCGCT GGAGTGGTAT CGAAAAGCCG CCTCTCCAAC CAATGATGCG
GGCGTTGTCC GCGGACAGGG TTCAGCTTCG TCAGCACAAT TTAAGCTTGG AGTAATGTAC
GCCCACGGTC AGGGCGTTCC TCAGGATTAT CAGCAAACGG CGATCTTGAT GCGTAAAGCG
GCGGAAAATA TGTACTACCC CGCGCAACTT TATCTCGGTG TTGCCTATTT TTATGGTGAA
GGCGTGCCTC AGGACTATCG TCAGGCAGTT TACTGGCTTA ATGAAGGTAT ACCAGGCAGC
TATACGCCAG GCCACATTCC GCTGAGTGCG CTCTATGATA AAGCGCATCC CGCTGACCGG
GTTCACTCTC AGACGTGGTA TCGAAAAACA GCGCAACGCG TGATGGCAAA GGTACAGTAC
AATTTTGGCG TATGGTATTA CAACGGTTAT CACTTATTGA AAGATCACAA CCTGGCGCTG
GAGTGGTATC GCAGAGCCGC AGCGCAAGGA TTGGCCGAGG CGCAGGATGC GATCGGCGTG
ATGTTTATGC AGGGTGAGGG CGTTTCTCAG GACTACCAAC AGGCGCTGGC GTGGTATCGC
AAAGCCGCTC GCCAGGGGCT GCCTGCTGCG CAAACGCATC TGGGCATCAT GTCTGCATTT
GGTCGCGGCG TCGCGCAAAG CGACAGACAG GCTATCGCAT GGTATCGAAA AGCAGCAAAA
CAGGATTTTG CGAAAGCGCA ATATCAGCTT GGCGTGGCAT ATAGCACGGG AAGAGGCGTG
CCTGAAAATA GCCGGAATGC GCTGAAGTGG TACCTTAAAG CGGCAGAGCA GGGGTTTACT
CCGGCACAAT TAGCGCTTGG GGAAATTTAT GCTCATGGGC GCCAGGGTGT GCCTAAAGAT
AATAAGCAGG CCTATATTTG GTATTACATG GCAAGTATAT ATACGGAAAA ATCTAAGGAT
GATTGTTCAG CGCTCATTGC CGAAAGAAAC CGACTCAAGG GAACGCTTAC CCCAGATCAA
CTTAGCGAGA CATATGCCGC CTTTAATCTC ATCTGGCGAA AAATTGACCA GTCAAAGGAG
GCGAAAAAGA CAGCCAGGAA GAAGTATTGA
 
Protein sequence
MRFSSIFKII ILNIFISKMA HAAVCEERYP ADSEECQYVQ ELEQKAEQGD ESAQFSLGSW 
YAEGRYVKPD YKLAIKWLEK AGKQGSDFSY FILGYHYNYG ENFPLSRQKA LEWYRKAAEL
GDSSTQEILG DAYMYGDGFP QNTQLALEWY RKAASPTNDA GVVRGQGSAS SAQFKLGVMY
AHGQGVPQDY QQTAILMRKA AENMYYPAQL YLGVAYFYGE GVPQDYRQAV YWLNEGIPGS
YTPGHIPLSA LYDKAHPADR VHSQTWYRKT AQRVMAKVQY NFGVWYYNGY HLLKDHNLAL
EWYRRAAAQG LAEAQDAIGV MFMQGEGVSQ DYQQALAWYR KAARQGLPAA QTHLGIMSAF
GRGVAQSDRQ AIAWYRKAAK QDFAKAQYQL GVAYSTGRGV PENSRNALKW YLKAAEQGFT
PAQLALGEIY AHGRQGVPKD NKQAYIWYYM ASIYTEKSKD DCSALIAERN RLKGTLTPDQ
LSETYAAFNL IWRKIDQSKE AKKTARKKY