Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3963 |
Symbol | rfaJ2 |
ID | 6144753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4041135 |
End bp | 4042130 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641618789 |
Product | lipopolysaccharide 1,2-glucosyltransferase |
Protein accession | YP_001745928 |
Protein GI | 170679827 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.170676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAAT TTATAAAAGA ACGGTTTTCG TATTTAGCAG ATAATAAAAA AGAAAACGTC CCAGAGCTAA ATGTTTCCTA CGGTATCGAT AAGAATTTTT TGTATGGTGC TGGCGTTTCA ATTTCTTCCG TTTTGATTAA TAATTCAGAT ATTAATTTTG TCTTTCATGT TTTCACTGAT TATGTGGATG ATGATTATTT AAAGTCATTT AATGAAACAG CAAAACAATT TAATACCTCA ATTATTGTAT ATTTAATTGA CCCAAAATAC TTTGCTGATC TGCCGACGTC ACAGTTTTGG TCGTACGCGA CATACTTCAG GGTATTGTCC TTTGAATATC TGAGTGAAAG TATTTCCACA CTGCTGTATC TGGATGCCGA TGTCGTTTGT AAAGGAAGTC TGAAACCTCT CACAAAAATT ATATTTAAAG ATGAGTTTGC TGCGGTTATT CCTGACAATG ATAGTACTCA GGCGGCATGT GCAAAACGTC TTAACATTCC CGAAATGAAT GGACGTTATT TCAATGCAGG CGTTATCTAT GTCAATCTTA AAAAATGGCA TGAAGCAAAT TTGACACCGT ATTTACTCAC GCTTTTACGA GGGGAAACTA AATATGGCTC TCTTAAATAT TTAGATCAGG ATGCGTTGAA TATCGCATTT AATATGAATA ATATCTACCT CGCGAAGGAT TTTGATACTA TTTATACCCT GAAAAACGAA CTTCATGATC GTAGTCATCG AAAGTATCAG CAAACCATTA CCGATAAAAC AGTATTGATT CACTATACAG GGATAACTAA ACCATGGCAT AGCTGGGCTG GATATCCGTC TGCATCATAC TTTAATATCG CGCGTGAACA ATCTCCCTGG AAGAAATATC CTCTTAAAGA GGCGCGGACT GTTGCAGAAA TGCAGAAACA ATATAAGCAT CTGTTTGCCC ATGGTGAGTA TATAAAAGGC ATAACTTCAT TAATTAAGTA CAAGCTTAAG AAATAA
|
Protein sequence | MNEFIKERFS YLADNKKENV PELNVSYGID KNFLYGAGVS ISSVLINNSD INFVFHVFTD YVDDDYLKSF NETAKQFNTS IIVYLIDPKY FADLPTSQFW SYATYFRVLS FEYLSESIST LLYLDADVVC KGSLKPLTKI IFKDEFAAVI PDNDSTQAAC AKRLNIPEMN GRYFNAGVIY VNLKKWHEAN LTPYLLTLLR GETKYGSLKY LDQDALNIAF NMNNIYLAKD FDTIYTLKNE LHDRSHRKYQ QTITDKTVLI HYTGITKPWH SWAGYPSASY FNIAREQSPW KKYPLKEART VAEMQKQYKH LFAHGEYIKG ITSLIKYKLK K
|
| |