Gene SNSL254_A2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2266 
Symbol 
ID6483680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2176176 
End bp2177375 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content34% 
IMG OID642737613 
ProductO-antigen polymerase 
Protein accessionYP_002041355 
Protein GI194445247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0000330742 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCCAT TTCCACCAGG AGCAATCCTA AGGGATGTAC TCAATGTATT TTTTGTGGCG 
TTAGTGCTAG TTCGATTTGT CATTGATAGG AAAAAAACTT ATTTCCCGTT GGTTTTTACT
ATTTTTTCAT GGTCGGCGGT AATACTATGG GTAATAGCGT TAACTATATT CTCACCGGAT
AAAATTCAAG CAATTATGGG GGGGCGGAGT TATATTTTAT TCCCGGCAGT TTTCATAGCA
TTAGTGATTT TAAAAGTATC ATACCCGCAA TCCTTAAATA TTGAAAAAAT AGTTTGCTAC
ATAATTTTTC TAATGTTTAT GGTTGCGACA ATATCTATTA TTGATGTACT AATGAATGGA
GAGTTCATTA AATTGCTCGG ATATGATGAG CATTATGCAG GAGAACAATT AAACTTAATT
AATAGCTATG ATGGGATGGT CCGGGCTACA GGCGGTTTTA GTGATGCTCT CAATTTTGGA
TATATGCTCA CATTAGGTGT TTTGTTATGT ATGGAGTGTT TTTCCCAAGG ATATAAAAGA
TTATTGATGC TCATTATTAG TTTTGTGCTA TTTATAGCGA TCTGCATGAG TCTTACTAGA
GGAGCAATAC TTGTTGCTGC GCTTATTTAC GCACTTTATA TAATTTCAAA TCGGAAGATG
CTTTTTTGTG GAATAACTTT ATTTGTAATA ATTATACCCG TTTTAGCAAT TTCTACTAAT
ATTTTTGACA ACTATACAGA AATTTTGATC GGCAGGTTTA CAGATTCGTC TCAGGCATCG
CGTGGATCTA CACAGGGGCG GATAGATATG GCAATTAATT CATTAAACTT CCTGTCAGAA
CATCCATCAG GTATAGGTCT GGGTACTCAA GGTTCAGGAA ACATGCTTTC GGTAAAAGAT
AATAGGTTAA ATACGGATAA TTATTTTTTC TGGATCGCCC TTGAGACTGG TATTATTGGC
TTAATCATAA ATATTATTTA TCTGGCAAGT CAATTTTATT CTTCAACTTT ACTAAATAGA
ATATATGGCA GTCATTGTAG CAATATGCAC TATAGATTAT ATTTTCTCTT TGGAAGTATA
TATTTTATAA GTGCAGCGTT AAGTTCAGCA CCTTCGTCAT CAACTTTTTC TATATATTAT
TGGACAGTTT TAGCTTTGAT TCCATTTTTA AAATTAACAA ATAGACGGTG CACGCGATAA
 
Protein sequence
MLPFPPGAIL RDVLNVFFVA LVLVRFVIDR KKTYFPLVFT IFSWSAVILW VIALTIFSPD 
KIQAIMGGRS YILFPAVFIA LVILKVSYPQ SLNIEKIVCY IIFLMFMVAT ISIIDVLMNG
EFIKLLGYDE HYAGEQLNLI NSYDGMVRAT GGFSDALNFG YMLTLGVLLC MECFSQGYKR
LLMLIISFVL FIAICMSLTR GAILVAALIY ALYIISNRKM LFCGITLFVI IIPVLAISTN
IFDNYTEILI GRFTDSSQAS RGSTQGRIDM AINSLNFLSE HPSGIGLGTQ GSGNMLSVKD
NRLNTDNYFF WIALETGIIG LIINIIYLAS QFYSSTLLNR IYGSHCSNMH YRLYFLFGSI
YFISAALSSA PSSSTFSIYY WTVLALIPFL KLTNRRCTR