Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3964 |
Symbol | rfaI |
ID | 6143989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4042147 |
End bp | 4043163 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641618790 |
Product | lipopolysaccharide 1,3-galactosyltransferase |
Protein accession | YP_001745929 |
Protein GI | 170681283 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.185573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCCC ACTATTTTAA TCCACAAGAG ATGATCAATA AGACAATCAT CTTCGATGAA AGGCCAGAGG CGTCAGTGGC ATCATCATTC CATGTTGCTT ATGGCATTGA TCAGAACTTT CTTTTTGGTT GTGGTGTTTC AATCACGTCA GTTTTGTTAC ATAACAACGA CGTGAGTTTT GTCTTCCACG TTTTTATTGA TGATATCCCT GACGCCGATA TCCAGCGTTT ATCCCAATTG GCGAAAAGCT ATCATACCTG TATCCAGATC CATCTGGTAA ATTGTGAACG GCTTAAGGTA TTACCGACGA CCAAAAATTG GTCTATTGCC ATGTATTTCC GTTTTGTAAT TGCAGATTAC TTTATTGATC AACAAGATAA GATCCTGTAC CTGGATGCTG ATATCGCCTG TCAGGGAACC TTAAAACCGC TGATAACAAT GGATCTTGCC AATAACGTTG CTGCTGTTGT TACTGAACGC GATGCTAACT GGTGGTCGTT ACGGGCTCAA AGTCTGCAGT GTAATGAACT TGAAAAGGGT TACTTTAATT CAGGTGTCCT GTTAATTAAT ACACCTGCGT GGGCGCAGGA GTCCGTTTCT GCTAAAGCGA TGTCGATGCT TGCTGATAAA GCCGTCGTTT CCCGTTTAAC CTATATGGAT CAAGATATAC TAAATCTTAT CCTGTTAGGG AAAGTTAAAT TCATTGATGC TAAATACAAT ACGCAATTTA GTTTAAATTA TGAATTAAAA AAATCATTTA TTTGTCCAAT TAATGATGAA GCCGTATTAA TTCATTATGT CGGCCCGACA AAACCCTGGC ATTACTGGGC CGGTTATCCA AGCGCGCAGC CATTTATCAA AGCCAAAGAA GCATCGCCCT GGAAAAATGA ACAGTTAATG CGGCCAATTA ACTCAAATTA TGCTCGTTAT TGCGCCAAGC ATAATTTTAA ACAAAATAAA CCAATTAACG GGATAATGAA TTATATTTAT TATTTTTATT TAAAGATAAT AAAATGA
|
Protein sequence | MSAHYFNPQE MINKTIIFDE RPEASVASSF HVAYGIDQNF LFGCGVSITS VLLHNNDVSF VFHVFIDDIP DADIQRLSQL AKSYHTCIQI HLVNCERLKV LPTTKNWSIA MYFRFVIADY FIDQQDKILY LDADIACQGT LKPLITMDLA NNVAAVVTER DANWWSLRAQ SLQCNELEKG YFNSGVLLIN TPAWAQESVS AKAMSMLADK AVVSRLTYMD QDILNLILLG KVKFIDAKYN TQFSLNYELK KSFICPINDE AVLIHYVGPT KPWHYWAGYP SAQPFIKAKE ASPWKNEQLM RPINSNYARY CAKHNFKQNK PINGIMNYIY YFYLKIIK
|
| |