Gene SNSL254_A2282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2282 
Symbol 
ID6486828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2192882 
End bp2194102 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content56% 
IMG OID642737628 
Productputative colanic acid biosynthesis glycosyltransferase WcaL 
Protein accessionYP_002041370 
Protein GI194442639 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00000100013 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGTCA GCTTTTTTCT GCTGAAATTT CCACTCTCAT CGGAAACCTT TGTGCTGAAT 
CAGATTACTG CGTTTATTGA TATGGGCCAT GAGGTGGAGA TTGTCGCGTT ACAAAAAGGC
GATACCCAAC ATACTCACGC CGCCTGGGAG AAGTATGGCC TGGCGGCGAA AACCCGCTGG
TTACAGGATG AGCCCCAGGG ACGGCTGGCG AAACTGCGCT ACCGGGCATG TAAAACGCTG
CCGGGGCTGC ATCGGGCGGC GACCTGGAAA GCGCTCAATT TTACCCGCTA TGGCGATGAA
TCACGCAATT TGATTCTTTC CGCGATTTGC GCGCAGGTGA GCCACCCTTT TGTGGCGGAT
GTGTTCATCG CGCACTTTGG TCCGGCGGGC GTGACGGCGG CTAAACTACG CGAACTGGGC
GTGCTTCGCG GCAAAATCGC GACTATTTTC CACGGGATTG ATATTTCCAG CCGCGAAGTG
CTCAGTCATT ACACGCCGGA GTATCAGCAG TTGTTTCGTC GTGGCGATCT GATGCTGCCC
ATCAGCGATC TGTGGGCCGG TCGCCTGAAA AGTATGGGCT GTCCGCCGGA AAAGATTGCC
GTTTCGCGCA TGGGCGTCGA CATGACGCGT TTTACCCATC GTCCGGTGAA AGCGCCAGGG
ATGCCGCTGG AGATGATTTC CGTCGCGCGC CTGACCGAGA AAAAAGGCCT GCATGTGGCG
ATTGAAGCCT GTCGGCAACT GAAAGCGCAG GGCGTGGCGT TTCGCTACCG CATTCTGGGG
ATTGGCCCGT GGGAACGTCG GCTGCGCACG CTCATCGAGC AGTATCAGCT AGAGGATGTC
ATTGAGATGC TGGGGTTTAA ACCGAGCCAT GAAGTGAAGG CGATGCTGGA TGACGCCGAT
GTTTTTTTGC TGCCGTCGAT TACCGGTACG GATGGCGATA TGGAAGGTAT TCCGGTAGCG
CTGATGGAGG CGATGGCGGT AGGGATTCCC GTGGTATCTA CCGTGCATAG CGGTATTCCG
GAACTGGTGG AGGCCGGCAA ATCCGGCTGG CTGGTGCCGG AAAACGATGC GCAGGCGCTG
GCGGCCCGAC TCGCTGAGTT CAGCCGGATT GACCACGACA CGCTGGAGTC GGTGATCACG
CGCGCCCGTG AAAAAGTGGC GCAAGATTTT AATCAGCAGG TGATTAATCG CCAGTTAGCC
TGCCTGCTAC AAACGATATA A
 
Protein sequence
MKVSFFLLKF PLSSETFVLN QITAFIDMGH EVEIVALQKG DTQHTHAAWE KYGLAAKTRW 
LQDEPQGRLA KLRYRACKTL PGLHRAATWK ALNFTRYGDE SRNLILSAIC AQVSHPFVAD
VFIAHFGPAG VTAAKLRELG VLRGKIATIF HGIDISSREV LSHYTPEYQQ LFRRGDLMLP
ISDLWAGRLK SMGCPPEKIA VSRMGVDMTR FTHRPVKAPG MPLEMISVAR LTEKKGLHVA
IEACRQLKAQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV IEMLGFKPSH EVKAMLDDAD
VFLLPSITGT DGDMEGIPVA LMEAMAVGIP VVSTVHSGIP ELVEAGKSGW LVPENDAQAL
AARLAEFSRI DHDTLESVIT RAREKVAQDF NQQVINRQLA CLLQTI