Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4054 |
Symbol | rfaI |
ID | 6270461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3786948 |
End bp | 3787964 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641727894 |
Product | lipopolysaccharide 1,3-galactosyltransferase |
Protein accession | YP_001882326 |
Protein GI | 187732682 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCCC ACTATTTTAA TCCACAAGAG ATGATCAATA AGACAATCAT CTTCGATGAA AGGCCAGCGG CGTCAGTAGC ATCATCATTC CATGTTGCTT ATGGCATTGA TAAAAACTTT CTTTTTGGTT GTGGTGTTTC AATCACGTCA GTTTTGTTAC ATAACAACGA CGTGAGTTTT GTTTTCCACG TTTTTATTGA TGATATCCCT GAAGCCGATA TCCAGCGTTT AGCCCAATTG GCGAAAAGCT ATCGTACCTG TATCCAGATC CATCTAGTAA ATTGTGAACG GCTTAAGGCA TTACCGACGA CCAAAAATTG GTCTATTGCC ATGTATTTCC GTTTTGTAAT TGCAGATTAC TTTATTGATC AACAAGATAA GATCTTTTAC CTGGATGCTG ATATCGCCTG TCAGGGAAAC TTAAAGCCGC TGATAACAAT GGATCTTGCC AATAACGTTG CTGCTGTTGT TACTGAACGC GATGCTAACT GGTGGTCGTT ACGGGGTCAA AGTCTGCAGT GTAATGAACT TGAAAAGGGC TACTTTAATT CAGGTGTCCT GTTAATTAAT ACGCTAGCGT GGGCGCAGGA GTCCGTTTCT GCTAAAGCGA TGTCGATGCT TGCTGATAAA GCCATCGTTT CCCGTTTCAC CTATATGGAT CAAGATATCC TTAATCTTAT CCTGTTAGGG AAAGTTAAAT TCATTGATGC TAAATACAAT ACGCAATTTA GTTTAAATTA TGAATTAAAA AAATCATTTG TTTGTCCAAT TAATGATGAA ACCGTATTAA TTCATTATGT CGGCCCGACA AAACCCTGGC ATTACTGGGC CGGTTATCCA AGTGCGCAAC CTTTTATCAA AGCCAAAGAA GCATCGCCCT GGAAAAATGA ACCGTTAATG CGGCCAGTTA ACTCAAACTA TGCTCGTTAT TGCGCCAAGC ATAATTTTAA ACAAAATAAA CCAATTAACG GGATAATGAA TTATATTTAT TATTTTTATT TAAAGATAAT AAAATGA
|
Protein sequence | MSAHYFNPQE MINKTIIFDE RPAASVASSF HVAYGIDKNF LFGCGVSITS VLLHNNDVSF VFHVFIDDIP EADIQRLAQL AKSYRTCIQI HLVNCERLKA LPTTKNWSIA MYFRFVIADY FIDQQDKIFY LDADIACQGN LKPLITMDLA NNVAAVVTER DANWWSLRGQ SLQCNELEKG YFNSGVLLIN TLAWAQESVS AKAMSMLADK AIVSRFTYMD QDILNLILLG KVKFIDAKYN TQFSLNYELK KSFVCPINDE TVLIHYVGPT KPWHYWAGYP SAQPFIKAKE ASPWKNEPLM RPVNSNYARY CAKHNFKQNK PINGIMNYIY YFYLKIIK
|
| |