Gene SbBS512_E4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4058 
SymbolwaaA 
ID6269421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3791362 
End bp3792639 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content54% 
IMG OID641727898 
Product3-deoxy-D-manno-octulosonic-acid transferase 
Protein accessionYP_001882330 
Protein GI187730590 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000579677 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGAAT TGCTTTACAC CGCCCTTCTC TACCTTATTC AGCCGCTGAT CTGGATACGG 
CTCTGGGTGC GCGGACGTAA GGCTCCGGCC TATCGAAAAC GCTGGGGTGA ACGTTACGGT
TTTTACCGCC ATCCGCTAAA ACCAGGCGGC ATTATGCTGC ACTCCGTCTC CGTCGGTGAA
ACTCTGGCGG TGATCCCGTT GGTGCGCGCG CTGCGTCATC GTTATCCTGA TTTACCGATT
ACCGTTACAA CCATGACGCC AACCGGTTCG GAGCGCGTAC AATCGGCTTT CGGGAAGGAT
GTTCAGCACG TTTATCTGCC GTATGACCTG CCCGATGCAC TTAATCGTTT CCTGAATAAA
GTCGACCCTA AACTGGTGTT GATTATGGAA ACCGAACTAT GGCCTAACCT GATTGCGGCG
CTACATAAAC GTAAAATTCC GCTGGTGATC GCGAACGCGC GACTCTCTGC CCGCTCGGCC
GCAGGTTATG CCAAACTGGG TAAATTCGTC CGTCGCTTGC TGCGTCGTAT TACGCTGATT
GCCGCCCAAA ATGAAGAAGA TGGTGCACGT TTTGTGGCGC TGGGCGCAAA AAATAACCAG
GTAACCGTTA CCGGTAGCCT GAAATTCGAT ATTTCTGTAA CGCCGCAGTT GGCTGCTAAA
GCCGTGACGC TGCGCCGCCA GTGGGCACCA CACCGCCCGG TATGGATTGC CACCAGCACT
CACGAAGGTG AAGAGAGTGT GGTGATCGCC GCACATCAGG TATTGTTACA GCAATTCCCG
AATTTATTGC TCATCCTGGT ACCCCGTCAT CCAGAACGCT TCCCGGATGC GATTAACCTT
GTCCGCCAGG CTGGACTAAG CTATATCACA CGCTCTTCAG GGGAAGTCCC CTCCACCAGC
ACGCAGGTTG TGGTTGGCGA TACGATGGGC GAGTTGATGT TACTGTACGG CATTGCCGAT
CTCGCCTTTG TTGGCGGTTC ACTGGTTGAA CGTGGTGGGC ATAATCCGCT GGAAGCTGCC
GCACACGCTA TTCCGGTATT GATGGGGCCG CATACTTTTA ATTTTAAAGA CATTTGCGCG
CGGCTGGAGC AGGCAAGCGG GCTGATTACC GTTACCGATG CCACTACGCT TGCAAAAGAG
GTTTCCTCTT TACTCACCGA CGCCGATTAC CGTAGTTTCT ATGGCCGTCA TGCCGTTGAA
GTACTGTATC AAAACCAGGG CGCGCTACAG CGCCTGCTTC AACTGCTGGA ACCTTACCTG
CCACCGAAAA CGCATTGA
 
Protein sequence
MLELLYTALL YLIQPLIWIR LWVRGRKAPA YRKRWGERYG FYRHPLKPGG IMLHSVSVGE 
TLAVIPLVRA LRHRYPDLPI TVTTMTPTGS ERVQSAFGKD VQHVYLPYDL PDALNRFLNK
VDPKLVLIME TELWPNLIAA LHKRKIPLVI ANARLSARSA AGYAKLGKFV RRLLRRITLI
AAQNEEDGAR FVALGAKNNQ VTVTGSLKFD ISVTPQLAAK AVTLRRQWAP HRPVWIATST
HEGEESVVIA AHQVLLQQFP NLLLILVPRH PERFPDAINL VRQAGLSYIT RSSGEVPSTS
TQVVVGDTMG ELMLLYGIAD LAFVGGSLVE RGGHNPLEAA AHAIPVLMGP HTFNFKDICA
RLEQASGLIT VTDATTLAKE VSSLLTDADY RSFYGRHAVE VLYQNQGALQ RLLQLLEPYL
PPKTH