Gene SbBS512_E4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4054 
SymbolrfaI 
ID6270461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3786948 
End bp3787964 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content38% 
IMG OID641727894 
Productlipopolysaccharide 1,3-galactosyltransferase 
Protein accessionYP_001882326 
Protein GI187732682 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCC ACTATTTTAA TCCACAAGAG ATGATCAATA AGACAATCAT CTTCGATGAA 
AGGCCAGCGG CGTCAGTAGC ATCATCATTC CATGTTGCTT ATGGCATTGA TAAAAACTTT
CTTTTTGGTT GTGGTGTTTC AATCACGTCA GTTTTGTTAC ATAACAACGA CGTGAGTTTT
GTTTTCCACG TTTTTATTGA TGATATCCCT GAAGCCGATA TCCAGCGTTT AGCCCAATTG
GCGAAAAGCT ATCGTACCTG TATCCAGATC CATCTAGTAA ATTGTGAACG GCTTAAGGCA
TTACCGACGA CCAAAAATTG GTCTATTGCC ATGTATTTCC GTTTTGTAAT TGCAGATTAC
TTTATTGATC AACAAGATAA GATCTTTTAC CTGGATGCTG ATATCGCCTG TCAGGGAAAC
TTAAAGCCGC TGATAACAAT GGATCTTGCC AATAACGTTG CTGCTGTTGT TACTGAACGC
GATGCTAACT GGTGGTCGTT ACGGGGTCAA AGTCTGCAGT GTAATGAACT TGAAAAGGGC
TACTTTAATT CAGGTGTCCT GTTAATTAAT ACGCTAGCGT GGGCGCAGGA GTCCGTTTCT
GCTAAAGCGA TGTCGATGCT TGCTGATAAA GCCATCGTTT CCCGTTTCAC CTATATGGAT
CAAGATATCC TTAATCTTAT CCTGTTAGGG AAAGTTAAAT TCATTGATGC TAAATACAAT
ACGCAATTTA GTTTAAATTA TGAATTAAAA AAATCATTTG TTTGTCCAAT TAATGATGAA
ACCGTATTAA TTCATTATGT CGGCCCGACA AAACCCTGGC ATTACTGGGC CGGTTATCCA
AGTGCGCAAC CTTTTATCAA AGCCAAAGAA GCATCGCCCT GGAAAAATGA ACCGTTAATG
CGGCCAGTTA ACTCAAACTA TGCTCGTTAT TGCGCCAAGC ATAATTTTAA ACAAAATAAA
CCAATTAACG GGATAATGAA TTATATTTAT TATTTTTATT TAAAGATAAT AAAATGA
 
Protein sequence
MSAHYFNPQE MINKTIIFDE RPAASVASSF HVAYGIDKNF LFGCGVSITS VLLHNNDVSF 
VFHVFIDDIP EADIQRLAQL AKSYRTCIQI HLVNCERLKA LPTTKNWSIA MYFRFVIADY
FIDQQDKIFY LDADIACQGN LKPLITMDLA NNVAAVVTER DANWWSLRGQ SLQCNELEKG
YFNSGVLLIN TLAWAQESVS AKAMSMLADK AIVSRFTYMD QDILNLILLG KVKFIDAKYN
TQFSLNYELK KSFVCPINDE TVLIHYVGPT KPWHYWAGYP SAQPFIKAKE ASPWKNEPLM
RPVNSNYARY CAKHNFKQNK PINGIMNYIY YFYLKIIK