Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3998 |
Symbol | |
ID | 6486384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3885952 |
End bp | 3886965 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642739258 |
Product | lipopolysaccharide 1,3-galactosyltransferase |
Protein accession | YP_002042968 |
Protein GI | 194442386 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.313383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAA AATATTTTGA AGAAGAAGTC ATTCAACAGA CTTTAGATTA TAACTATGCA CAACATAGTG ATGCTGATAA ATTTAATATA GCTTATGGGA TTGATAAAAA CTTTCTTTTT GGCTGTGGTG TCTCTATTGC ATCGGTTCTC CTCGCTAACC CAGAGAAGGC GTTAGCTTTC CATGTTTTTA CCGATTTCTT TGACTCTGAA GACCAGCAAC GATTTGAGGC ATTAGCAAAA CAGTACGCTA CGCAGATCGT TGTTTACCTA ATCGACTGTG AGCGCTTAAA ATCGCTACCC AGTACCAAAA ACTGGACCTA TGCAACATAC TTTAGATTCA TTATCGCCGA TTATTTTTCA GATAAAACAG ATAGAGTACT TTATCTGGAT GCAGATATTG CATGTAAAGG GAGTATTCAA GAACTTATTG ATCTTAATTT CGCTGAAAAT GAGATTGCGG CGGTTGTTGC TGAAGGCGAG TTGGAGTGGT GGACTAAGCG CTCGGTTAGC CTGGCAACGC CTGGGCTGGT TTCTGGCTAT TTTAATGCCG GTTTTATTTT AATTAACATA CCTCTTTGGA CCGCAGAAAA TATCTCTAAG AAAGCGATTG AAATGCTAAA AGATCCGGAG GTAGTACAGC GCATAACGCA CCTTGATCAG GATGTATTAA ATATATTGTT AGTGAATAAA GCTCGTTTTG TTGATAAAAA ATTTAATACA CAATTTAGTC TTAACTATGA ATTAAAAGAT TCAGTTATTA ATCCAGTCGA TGCTGAGACT GTATTTGTTC ATTATATCGG ACCAACGAAG CCCTGGCATA GTTGGGGGGC TTACCCTGTG TCACAATATT TTTTACAGGC TAAGAGCAAC TCACCATGGT CTCATTGTGC ACTTTTAAAT CCAGTGACTA GCCATCAGTT ACGTTATGCG GCAAAGCATA TGTTTAATCA GAAGCATTAT ACTTCGGGTA TAAATTATTA TATAGCCTAC TTTAAACGTA AACTTCTTGA ATAA
|
Protein sequence | MSRKYFEEEV IQQTLDYNYA QHSDADKFNI AYGIDKNFLF GCGVSIASVL LANPEKALAF HVFTDFFDSE DQQRFEALAK QYATQIVVYL IDCERLKSLP STKNWTYATY FRFIIADYFS DKTDRVLYLD ADIACKGSIQ ELIDLNFAEN EIAAVVAEGE LEWWTKRSVS LATPGLVSGY FNAGFILINI PLWTAENISK KAIEMLKDPE VVQRITHLDQ DVLNILLVNK ARFVDKKFNT QFSLNYELKD SVINPVDAET VFVHYIGPTK PWHSWGAYPV SQYFLQAKSN SPWSHCALLN PVTSHQLRYA AKHMFNQKHY TSGINYYIAY FKRKLLE
|
| |