Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2282 |
Symbol | |
ID | 6486828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2192882 |
End bp | 2194102 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642737628 |
Product | putative colanic acid biosynthesis glycosyltransferase WcaL |
Protein accession | YP_002041370 |
Protein GI | 194442639 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.00000100013 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGTCA GCTTTTTTCT GCTGAAATTT CCACTCTCAT CGGAAACCTT TGTGCTGAAT CAGATTACTG CGTTTATTGA TATGGGCCAT GAGGTGGAGA TTGTCGCGTT ACAAAAAGGC GATACCCAAC ATACTCACGC CGCCTGGGAG AAGTATGGCC TGGCGGCGAA AACCCGCTGG TTACAGGATG AGCCCCAGGG ACGGCTGGCG AAACTGCGCT ACCGGGCATG TAAAACGCTG CCGGGGCTGC ATCGGGCGGC GACCTGGAAA GCGCTCAATT TTACCCGCTA TGGCGATGAA TCACGCAATT TGATTCTTTC CGCGATTTGC GCGCAGGTGA GCCACCCTTT TGTGGCGGAT GTGTTCATCG CGCACTTTGG TCCGGCGGGC GTGACGGCGG CTAAACTACG CGAACTGGGC GTGCTTCGCG GCAAAATCGC GACTATTTTC CACGGGATTG ATATTTCCAG CCGCGAAGTG CTCAGTCATT ACACGCCGGA GTATCAGCAG TTGTTTCGTC GTGGCGATCT GATGCTGCCC ATCAGCGATC TGTGGGCCGG TCGCCTGAAA AGTATGGGCT GTCCGCCGGA AAAGATTGCC GTTTCGCGCA TGGGCGTCGA CATGACGCGT TTTACCCATC GTCCGGTGAA AGCGCCAGGG ATGCCGCTGG AGATGATTTC CGTCGCGCGC CTGACCGAGA AAAAAGGCCT GCATGTGGCG ATTGAAGCCT GTCGGCAACT GAAAGCGCAG GGCGTGGCGT TTCGCTACCG CATTCTGGGG ATTGGCCCGT GGGAACGTCG GCTGCGCACG CTCATCGAGC AGTATCAGCT AGAGGATGTC ATTGAGATGC TGGGGTTTAA ACCGAGCCAT GAAGTGAAGG CGATGCTGGA TGACGCCGAT GTTTTTTTGC TGCCGTCGAT TACCGGTACG GATGGCGATA TGGAAGGTAT TCCGGTAGCG CTGATGGAGG CGATGGCGGT AGGGATTCCC GTGGTATCTA CCGTGCATAG CGGTATTCCG GAACTGGTGG AGGCCGGCAA ATCCGGCTGG CTGGTGCCGG AAAACGATGC GCAGGCGCTG GCGGCCCGAC TCGCTGAGTT CAGCCGGATT GACCACGACA CGCTGGAGTC GGTGATCACG CGCGCCCGTG AAAAAGTGGC GCAAGATTTT AATCAGCAGG TGATTAATCG CCAGTTAGCC TGCCTGCTAC AAACGATATA A
|
Protein sequence | MKVSFFLLKF PLSSETFVLN QITAFIDMGH EVEIVALQKG DTQHTHAAWE KYGLAAKTRW LQDEPQGRLA KLRYRACKTL PGLHRAATWK ALNFTRYGDE SRNLILSAIC AQVSHPFVAD VFIAHFGPAG VTAAKLRELG VLRGKIATIF HGIDISSREV LSHYTPEYQQ LFRRGDLMLP ISDLWAGRLK SMGCPPEKIA VSRMGVDMTR FTHRPVKAPG MPLEMISVAR LTEKKGLHVA IEACRQLKAQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV IEMLGFKPSH EVKAMLDDAD VFLLPSITGT DGDMEGIPVA LMEAMAVGIP VVSTVHSGIP ELVEAGKSGW LVPENDAQAL AARLAEFSRI DHDTLESVIT RAREKVAQDF NQQVINRQLA CLLQTI
|
| |