Gene SNSL254_A3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3090 
Symbol 
ID6483176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3005618 
End bp3007399 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content52% 
IMG OID642738402 
Productcell invasion protein SipB 
Protein accessionYP_002042126 
Protein GI194444977 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0000000315688 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTAAATG ACGCAAGTAG CATTAGCCGT AGCGGATATA CCCAAAATCC GCGCCTCGCT 
GAGGCGGCTT TTGAAGGCGT TCGTAAGAAC ACGGACTTTT TAAAAGCGGC GGATAAAGCT
TTTAAAGATG TGGTGGCAAC GAAAGCGGGC GACCTTAAAG CCGGAACAAA GTCCGGCGAG
AGCGCTATTA ATACGGTGGG TCTAAAGCCG CCTACGGACG CCGCCCGGGA AAAACTCTCC
AGCGAAGGGC AATTGACATT ACTGCTTGGC AAGTTAATGA CCCTACTGGG CGATGTTTCG
CTGTCTCAAC TGGAGTCTCG TCTGGCGGTA TGGCAGGCGA TGATTGAGTC ACAAAAAGAG
ATGGGGATTC AGGTATCGAA AGAATTCCAG ACGGCTCTGG GAGAGGCTCA GGAGGCGACG
GATCTCTATG AAGCCAGCAT CAAAAAGACG GATACCGCCA AGAGTGTTTA TGACGCTGCG
GCCAAAAAAC TGACGCAGGC ACAAAATAAA TTGCAATCGC TGGACCCGGC TGACCCCGGC
TATGCACAAG CTGAAGCCGC GGTAGAACAG GCCGGAAAAG AAGCGACAGA GGCGAAAGAG
GCCTTAGATA AGGCCACGGA TGCGACGGTT AAAGCAGGCA CAGACGCCAA AGCGAAAGCC
GAGAAAGCGG ATAACATTCT GACCAAATTC CAGGGAACGG CTAATGCCGC CTCTCAGAAT
CAGGTTTCCC AGGGTGAGCA GGATAATCTG TCAAATGTCG CCCGCCTCAC TATGCTCATG
GCCATGTTTA TTGAGATTGT GGGCAAAAAT ACGGAAGAAA GCCTGCAAAA CGATCTTGCG
CTTTTCAACG CCTTGCAGGA AGGGCGTCAG GCGGAGATGG AAAAGAAATC GGCTGAATTC
CAGGAAGAGA CGCGCAAAGC CGAGGAAACG AACCGCATTA TGGGATGTAT CGGGAAAGTC
CTCGGCGCGC TGCTAACCAT TGTCAGCGTT GTGGCCGCTG TTTTTACCGG TGGGGCGAGT
CTGGCGCTGG CTGCGGTGGG ACTTGCGGTA ATGGTTGCCG ATGAAATTGT GAAGGCGGCG
ACGGGGGTGT CGTTTATTCA GCAGGCGCTA AACCCGATTA TGGAGCATGT GCTGAAGCCG
TTAATGGAGC TGATTGGCAA GGCGATTACC AAAGCGCTGG AAGGATTAGG CGTCGATAAG
AAAACGGCAG AGATGGCAGG CAGCATTGTT GGTGCGATTG TTGCCGCTAT TGCGATGGTA
GCGGTCATTG TGGTGGTCGC AGTTGTCGGG AAAGGCGCGG CGGCGAAACT GGGTAACGCG
CTGAGCAAAA TGATGGGCGA AACGATTAAG AAGTTGGTGC CTAACGTGCT GAAACAGTTA
GCACAAAACG GCAGCAAACT CTTTACCCAG GGGATGCAAC GTATTACTAG CGGCCTGGGT
AATGTGGGTA GCAAGATGGG CCTGCAAACG AATGCCTTAA GTAAAGAGCT GGTAGGTAAT
ACCCTAAATA AAGTGGCGTT GGGCATGGAA GTCACGAATA CCGCAGCCCA GTCAGCCGGT
GGTGTTGCCG AGGGGGTATT TATTAAAAAT GCCAGCGAGG CGCTTGCTGA TTTTATGCTC
GCCCGTTTTG CCATGGATCA AATTCAGCAG TGGCTTAAAC AATCCGTAGA AATATTTGGT
GAAAACCAGA AGGTAACGGC GGAACTGCAA AAAGCCATGT CTTCTGCGGT ACAGCAAAAT
GCGGATGCTT CGCGTTTTAT TCTGCGCCAG AGTCGCGCAT AA
 
Protein sequence
MVNDASSISR SGYTQNPRLA EAAFEGVRKN TDFLKAADKA FKDVVATKAG DLKAGTKSGE 
SAINTVGLKP PTDAAREKLS SEGQLTLLLG KLMTLLGDVS LSQLESRLAV WQAMIESQKE
MGIQVSKEFQ TALGEAQEAT DLYEASIKKT DTAKSVYDAA AKKLTQAQNK LQSLDPADPG
YAQAEAAVEQ AGKEATEAKE ALDKATDATV KAGTDAKAKA EKADNILTKF QGTANAASQN
QVSQGEQDNL SNVARLTMLM AMFIEIVGKN TEESLQNDLA LFNALQEGRQ AEMEKKSAEF
QEETRKAEET NRIMGCIGKV LGALLTIVSV VAAVFTGGAS LALAAVGLAV MVADEIVKAA
TGVSFIQQAL NPIMEHVLKP LMELIGKAIT KALEGLGVDK KTAEMAGSIV GAIVAAIAMV
AVIVVVAVVG KGAAAKLGNA LSKMMGETIK KLVPNVLKQL AQNGSKLFTQ GMQRITSGLG
NVGSKMGLQT NALSKELVGN TLNKVALGME VTNTAAQSAG GVAEGVFIKN ASEALADFML
ARFAMDQIQQ WLKQSVEIFG ENQKVTAELQ KAMSSAVQQN ADASRFILRQ SRA