Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3837 |
Symbol | rfaJ2 |
ID | 5592840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3832695 |
End bp | 3833690 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640922949 |
Product | lipopolysaccharide 1,2-glucosyltransferase |
Protein accession | YP_001460427 |
Protein GI | 157163109 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0000031402 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAT TTATAAAAGA ACGGTTTTCG TATTTAGCAG ATAATAAAAA AGAAAACGCC CCAGAGCTAA ATGTTTCCTA CGGTATCGAT AAGAATTTTT TGTATGGTGC TGGCGTTTCA ATTTCTTCCG TTTTGATTAA TAATTCAGAT ATTAATTTTG TCTTTCATGT TTTCACTGAT TATGTGGATG ATAATTATTT AAAGTCATTT AATGAAACAG CAAAACAATT TAATACCTCA ATTATTGTAT ATTTAATTGA CCCCAAATAC TTTGCTGATC TGCCGACGTC ACAGTTTTGG TCGTACGCGA CATACTTCAG GGTATTGTCT TTTGAATATC TGAGTGAAAG TATTTCCACA CTGCTGTATC TGGATGCCGA TGTTGTTTGT AAAGGAAGCC TGAAACCTCT CACAGAAATT ATATTTAAAG ATGAGTTTGC TGCGGTCATT CCTGACAATG ATAGTACTCA GGCGGCATGT GCAAAACGCC TCAACATTCC CGAAATGAAT GGACGTTATT TCAATGCAGG CGTTATCTAT GTCAATCTTA AAAAATGGCA TGAAGCAAAT TTGACACCGT ATTTACTCAC GCTTTTACGA GGGGAAACTA AATATGGCTC TCTTAAATAT TTAGATCAGG ATGCGTTGAA TATCGCATTT AATATGAATA ATATCTACCT CGCGAAGGAT TTTGATACTA TTTATACCCT GAAAAACGAA CTTCATGATC GTAGTCATCG AAAGTATCAG CAAACCATTA CTGATAAAAC AGTATTGATT CACTATACAG GGATAACTAA ACCATGGCAT AGCTGGGCTG GATATCCGTC TGCATCATAC TTTAATATCG CGCGTGAACA ATCTCCCTGG AAGAAATATC CTCTTAAAGA GGCGCGGACT GTTGCAGAAA TGCAGAAACA ATATAAGCAT CTGTTTGCCC ATGGTGAGTA TATTAAAGGT ATAACTTCAT TAATTAAGTA CAAGCTTAAG AAATAA
|
Protein sequence | MNEFIKERFS YLADNKKENA PELNVSYGID KNFLYGAGVS ISSVLINNSD INFVFHVFTD YVDDNYLKSF NETAKQFNTS IIVYLIDPKY FADLPTSQFW SYATYFRVLS FEYLSESIST LLYLDADVVC KGSLKPLTEI IFKDEFAAVI PDNDSTQAAC AKRLNIPEMN GRYFNAGVIY VNLKKWHEAN LTPYLLTLLR GETKYGSLKY LDQDALNIAF NMNNIYLAKD FDTIYTLKNE LHDRSHRKYQ QTITDKTVLI HYTGITKPWH SWAGYPSASY FNIAREQSPW KKYPLKEART VAEMQKQYKH LFAHGEYIKG ITSLIKYKLK K
|
| |