Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4057 |
Symbol | rfaQ |
ID | 6269922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3789891 |
End bp | 3790949 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641727897 |
Product | lipopolysaccharide core biosynthesis protein |
Protein accession | YP_001882329 |
Protein GI | 187733694 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | [TIGR02201] lipopolysaccharide heptosyltransferase III, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.245519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATAAGC TATTTCGAAG AATTTTGCTC ATTAAGATGC GTTTTCATGG GGATATGTTA TTAACTACTC CCGTCATTAG TTCGCTGAAA AAAAATTACC CTGACGCAAA AATCGATGTG CTGCTTTATC AGGACACCAT CCCGATCCTG TCTGAAAATC CAGAGATTAA CGCGCTCTAC GGCATAAAAA ATAAAAAAGC AAAAGCCTCA GAAAAAATTG CCAACTTTTT TCATCTCATC AAGGTATTAC GTGCCAATAA GTATGACCTT ATCGTCAATC TCACCGATCA ATGGATGGTT GCTATACTGG TTCGCTTATT AAATGCCCGT GTGAAAATTT CCCAGGATTA TCATCATCGG CAGTCTGCTT TTTGGCGTAA AAGTTTCACC CATTTGGTGC CGTTGCAGGG TGGAAATGTG GTGGAAAGTA ACTTATCCGT GCTGACCCCA TTGGGAGTTG ATTCGTTGGT GAAGCAGACA ACCATGAGTT ACCCGCCTGC AAGCTGGAAA CGTATGCGTC GCGAACTTGA TCACGCTGGT GTTGGACAAA ATTATGTGGT TATCCAACCT ACGGCGCGGC AAATCTTCAA ATGCTGGGAC AACGCCAAGT TTTCCGCTGT GATTGATGCC TTACATGTTC GTGGTTATGA AGTTGTTCTG ACGTCCGGCC CAGATAAAGA CGATCTGGCC TGCGTCAATG AAATTGCGCA GGGATGCCAG ACGCCACCAG TAACGGCGCT GGCTGGAAAG GTGACCTTCC CGGAACTTGG TGCGTTAATC GATCATGCGC AGCTGTTTAT TGGCGTTGAT TCCGCACCGG CGCATATTGC CGCTGCAGTT AATACGCCGC TGATATCGCT GTTTGGTGCG ACAGACCATA TTTTCTGGCG TCCCTGGTCA AATAACATGA TTCAATTCTG GGCGGGAGAT TACCGGGAAA TGCCAACGCG CGATCAGCGT GACCGAAATG AGATGTATCT TTCGGTTATT CCGGCGGCAG ATGTCATTGC TGCTGTCGAT AAATTACTGC CCTCCTCCAC GACAGGTACG TCGTTATGA
|
Protein sequence | MDKLFRRILL IKMRFHGDML LTTPVISSLK KNYPDAKIDV LLYQDTIPIL SENPEINALY GIKNKKAKAS EKIANFFHLI KVLRANKYDL IVNLTDQWMV AILVRLLNAR VKISQDYHHR QSAFWRKSFT HLVPLQGGNV VESNLSVLTP LGVDSLVKQT TMSYPPASWK RMRRELDHAG VGQNYVVIQP TARQIFKCWD NAKFSAVIDA LHVRGYEVVL TSGPDKDDLA CVNEIAQGCQ TPPVTALAGK VTFPELGALI DHAQLFIGVD SAPAHIAAAV NTPLISLFGA TDHIFWRPWS NNMIQFWAGD YREMPTRDQR DRNEMYLSVI PAADVIAAVD KLLPSSTTGT SL
|
| |