Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4043 |
Symbol | |
ID | 6488846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 3926525 |
End bp | 3927538 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642744144 |
Product | lipopolysaccharide 1,3-galactosyltransferase |
Protein accession | YP_002047749 |
Protein GI | 194451361 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.149434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAA AATATTTTGA AGAAGAAGTC ATTCAACAGA CTTTAGATTA TAACTATGCA CAACATAGTG ATGCTGATAA ATTTAATATA GCTTATGGGA TTGATAAAAA CTTTCTTTTT GGCTGTGGTG TCTCTATTGC ATCGGTTCTC CTCGCTAATC CAGAGAAGGC GTTAGCTTTC CATGTTTTTA CCGATTTCTT TGACTCTGAA GACCAGCAGC GATTTGAGGC ATTAGCAAAA CAGTACGCTA CGCAGATTGT TGTTTACCTA ATCGACTGTG AGCGCTTAAA ATCGCTACCC AGTACCAAAA ACTGGACCTA TGCAACATAC TTTAGATTCA TTATCGCCGA TTATTTTTCA GATAAAACAG ATAGAGTACT TTATCTGGAT GCAGATATTG CATGTAAGGG GAGTATTCAG GAACTTATTG ATCTTAATTT TGCTGAAAAT GAGATTGCGG CTGTCGTTGC TGAAGGCGAG TTGGAATGGT GGACTAAGCG CTCGGTTAGC CTGGCAACGC CTGGGCTGGT TTCTGGCTAT TTTAATGCCG GTTTTATTTT AATTAACATA CCTCTTTGGA CTGCAGAAAA TATCTCTAAG AAAGCGATTG AAATGCTAAA AGATCCAGAG GTAGTACAGC GCATAACGCA CCTTGATCAG GATGTATTAA ATATATTTTT AGTGAATAAA GCGCGTTTTG TTGATAAAAA ATTTAATACA CAATTTAGTC TTAACTATGA ATTAAAAGAT TCAGTTATTA ATCCAGTTGA TGCTGAGACT GTATTTGTTC ATTATATCGG ACCAACGAAG CCCTGGCATA GTTGGGGGGC TTACCCCGTG TCACAATATT TTTTACAGGC TAAGAGCAAC TCACCATGGT CTCATTGTGC ACTTTTAAAT CCAGTGACTA GCCATCAGTT ACGTTATGCG GCAAAGCATA TGTTTAATCA GAAGCATTAT ACTTCGGGTA TAAATTATTA TATAGCCTAC TTTAAACGTA AACTTCTTGA ATAA
|
Protein sequence | MSRKYFEEEV IQQTLDYNYA QHSDADKFNI AYGIDKNFLF GCGVSIASVL LANPEKALAF HVFTDFFDSE DQQRFEALAK QYATQIVVYL IDCERLKSLP STKNWTYATY FRFIIADYFS DKTDRVLYLD ADIACKGSIQ ELIDLNFAEN EIAAVVAEGE LEWWTKRSVS LATPGLVSGY FNAGFILINI PLWTAENISK KAIEMLKDPE VVQRITHLDQ DVLNIFLVNK ARFVDKKFNT QFSLNYELKD SVINPVDAET VFVHYIGPTK PWHSWGAYPV SQYFLQAKSN SPWSHCALLN PVTSHQLRYA AKHMFNQKHY TSGINYYIAY FKRKLLE
|
| |