Gene SeSA_A2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2333 
Symbol 
ID6515316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2202985 
End bp2204205 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID642747396 
Productputative colanic acid biosynthesis glycosyltransferase WcaL 
Protein accessionYP_002115189 
Protein GI194735345 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.68197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCA GCTTTTTTCT GCTGAAATTT CCACTCTCAT CGGAAACCTT TGTGCTGAAT 
CAGATTACTG CGTTTGTTGA TATGGGCCAT GAGGTGGAGA TTGTCGCGTT ACAAAAAGGC
GATACCCAAC ATACTCACGC CGCCTGGGAG AAGTATGGCC TGGCGGCGAA AACCCGCTGG
TTACAGGATG AGCCCCAGGG ACGGCTGGCG AAACTGCGCT ACCGGGCATG TAAAACGCTG
CCGGGGCTGC ATCGTGCGGT GACCTGGAAA GCGCTCAATT TTATCCGCTA TGGCGATGAA
TCACGCAATT TGATCCTTTC CGCGATTTGC GCGCAGGTGA GCCACCCTTT TGTGGCGGAT
GTGTTTATCG CGCACTTTGG TCCGGCGGGC GTGACGGCGG CCAAACTACG CGAACTGGGC
GTGCTTCGCG GCAAAATCGC GACTATTTTC CACGGGATTG ATATTTCCAG CCGCGAAGTG
CTCAGTCATT ACACGCCGGA GTATCAGCAG TTGTTTCGTC GTGGCGATCT GATGCTGCCC
ATCAGCGATC TGTGGGCCGG TCGCCTGAAA AGTATGGGCT GCCCGCCGGA AAAGATTGCC
GTTTCGCGCA TGGGCGTCGA CATGACGCGT TTTTCCCATC GTCCGGTGAA AGCGCCAGGG
ATGCCGCTGG AGATGATTTC CGTCGCGCGC CTGACCGAGA AAAAAGGACT GCATGTGGCG
ATTGAAGCCT GTCGGCAACT GAAAGCGCAG GGCGTGGCGT TTCGCTACCG CATTCTGGGG
ATTGGCCCGT GGGAACGTCG GCTGCGCACG CTCATCGAGC AGTATCAGTT AGAGGATGTC
ATTGAGATGC CGGGGTTTAA ACCGAGCCAT GAAGTGAAGG CGATGCTGGA TGACGCCGAT
GTTTTTTTGC TGCCGTCGAT TACCGGTACG GATGGCGATA TGGAAGGTAT TCCGGTAGCG
CTGATGGAGG CGATGGCGGT GGGGATTCCC GTGGTGTCTA CCGTGCATAG CGGCATTCCG
GAACTGGTGG AGGCCGGCAA ATCCGGCTGG CTGGTGCCGG AAAACGATGC GCAGGCGCTG
GCGGCCCGAC TCGCTGAGTT CAGCCGGATT GACCACGACA CGCTGGAGTC GGTGATCACG
CGCGCCCGTG AAAAAGTGGC GCAAGATTTT AACCAGCAGG TGATTAATCG CCAGTTAGCC
AGCCTGCTAC AAACGATATA A
 
Protein sequence
MKVSFFLLKF PLSSETFVLN QITAFVDMGH EVEIVALQKG DTQHTHAAWE KYGLAAKTRW 
LQDEPQGRLA KLRYRACKTL PGLHRAVTWK ALNFIRYGDE SRNLILSAIC AQVSHPFVAD
VFIAHFGPAG VTAAKLRELG VLRGKIATIF HGIDISSREV LSHYTPEYQQ LFRRGDLMLP
ISDLWAGRLK SMGCPPEKIA VSRMGVDMTR FSHRPVKAPG MPLEMISVAR LTEKKGLHVA
IEACRQLKAQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV IEMPGFKPSH EVKAMLDDAD
VFLLPSITGT DGDMEGIPVA LMEAMAVGIP VVSTVHSGIP ELVEAGKSGW LVPENDAQAL
AARLAEFSRI DHDTLESVIT RAREKVAQDF NQQVINRQLA SLLQTI