Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3841 |
Symbol | rfaQ |
ID | 5593299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3836650 |
End bp | 3837708 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640922953 |
Product | lipopolysaccharide core biosynthesis protein |
Protein accession | YP_001460431 |
Protein GI | 157163113 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | [TIGR02201] lipopolysaccharide heptosyltransferase III, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00000000196072 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATAAGC CATTTCGAAG AATTTTGCTC ATTAAGATGC GTTTTCATGG GGATATGTTA TTAACTACTC CCGTCATTAG TTCGCTGAAA AAAAATTACC CTGACGCAAA AATCGATGTG CTGCTTTATC AGGACACCAT CCCGATCCTG TCTGAAAATC CAGAGATTAA CGCGCTCTAC GGCATAAAAA ATAAAAAAGC AAAAGCCTCA GAAAAAATTG CCAACTTTTT TCATCTCATC AAGGTATTAC GTGCCAATAA GTATGACCTT ATCGTCAATC TTACCGATCA ATGGATGGTT GCTATACTGG TTCGCTTATT AAATGCTCGT GTGAAAATTT CCCAGGATTA TCATCATCGG CAGTCTGCTT TTTGGCGTAA AAGTTTCACC CATTTGGTGC CGTTGCAGGG TGGAAATGTG GTGGAAAGTA ACTTATCCGT GCTGACACCA TTGGGACTTG ATTCGTTGGT GAAGCAGACA ACCATGAGTT ACCCGCCTGC AAGCTGGAAA CGTATGCGTC GCGAACTTGA TCACGCTGGT GTTGGACAAA ATTATGTGGT TATCCAACCT ACGGCGCGGC AAATCTTCAA ATGCTGGGAC AACGCCAAGT TTTCCGCTGT GATTGATGCC TTACATGCTC GTGGTTATGA AGTCGTTCTG ACGTCCGGCC CGGATAAAGA CGATCTGGCC TGCGTCAATG AAATTGCGCA GGGATGCCAG ACGCCACCAG TAACGGCGCT GGCTGGAAAG GTGACCTTCC CGGAACTTGG TGCGTTAATC GATCATGCGC AGCTGTTTAT TGGCGTTGAT TCCGCACCGG CGCATATTGC CGCTGCAGTT AATACGCCGC TGATATCGCT GTTTGGTGCG ACAGACCATA TTTTCTGGCG TCCCTGGTCA AATAACATGA TTCAATTCTG GGCGGGAGAT TACCGGGAAA TGCCAACGCG CGATCAGCGT GACCGAAATG AGATGTATCT TTCGGTTATT CCGGCGGCAG ATGTCATTGC TGCTGTCGAT AAATTACTGC CCTCCTCCAC GACAGGTACG TCGTTATGA
|
Protein sequence | MDKPFRRILL IKMRFHGDML LTTPVISSLK KNYPDAKIDV LLYQDTIPIL SENPEINALY GIKNKKAKAS EKIANFFHLI KVLRANKYDL IVNLTDQWMV AILVRLLNAR VKISQDYHHR QSAFWRKSFT HLVPLQGGNV VESNLSVLTP LGLDSLVKQT TMSYPPASWK RMRRELDHAG VGQNYVVIQP TARQIFKCWD NAKFSAVIDA LHARGYEVVL TSGPDKDDLA CVNEIAQGCQ TPPVTALAGK VTFPELGALI DHAQLFIGVD SAPAHIAAAV NTPLISLFGA TDHIFWRPWS NNMIQFWAGD YREMPTRDQR DRNEMYLSVI PAADVIAAVD KLLPSSTTGT SL
|
| |