Gene EcSMS35_3968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3968 
SymbolwaaA 
ID6146786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4046561 
End bp4047838 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content54% 
IMG OID641618794 
Product3-deoxy-D-manno-octulosonic-acid transferase 
Protein accessionYP_001745933 
Protein GI170682133 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00405432 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAT TGCTTTACAC CGCCCTTCTC TACCTTATTC AGCCGCTGAT CTGGATACGG 
CTCTGGGTGC GCGGACGTAA GGCTCCGGCC TATCGAAAAC GCTGGGGTGA ACGTTACGGT
TTTTACCGCC ATCCGCTAAA ACCAGGCGGC ATTATGCTGC ACTCCGTCTC CGTCGGTGAA
ACTCTGGCGG CGATCCCGTT GGTGCGCGCG CTGCGCCATC GTTATCCTGA TTTACCGATT
ACCGTTACAA CCATGACGCC AACCGGTTCG GAGCGCGTAC AATCGGCTTT CGGGAAGGAT
GTTCAGCACG TTTATCTGCC GTACGACCTG CCCGATGCGC TTAATCGTTT CCTGAATAAA
GTCGACCCTA AACTGGTGTT GATTATGGAA ACCGAACTAT GGCCTAACCT GATTGCGGCG
CTACATAAAC GTAAAATTCC GCTGGTGATC GCTAACGCGA GACTCTCTGC CCGCTCGGCC
GCAGGTTATG CCAAACTGGG TAAATTCGTC CGTCGCTTGC TGCGTCGTAT TACGCTGATT
GCTGCGCAAA ATGAAGAAGA TGGTGCACGT TTTGTATCAC TGGGCGCAAA AAATAATCAG
GTGACCGTCA CCGGTAGCCT GAAATTCGAT ATTTCTGTAA CGCCGCAACT GGCTGCTAAA
GCAGTGACGC TGCGCCGCCA GTGGGCACCA CACCGCCCGG TATGGATTGC CACCAGCACT
CACGAAGGCG AAGAGAGTGT GGTGATCGCC GCACATCAGG CATTGTTACA GCAATTCCCG
AATTTATTGC TCATCCTGGT ACCCCGTCAT CCGGAACGCT TCCCGGATGC GATTAACCTT
GTCCGCCAGG CTGGACTAAG CTATATCACA CGATCTTCAG GGGAAGTCCC CTCCACCAGC
ACGCAGGTTG TGGTTGGCGA TACGATGGGC GAGTTGATGT TACTGTACGG CATTGCCGAT
CTCGCCTTTG TTGGCGGTTC ACTGGTTGAA CGTGGTGGGC ATAATCCGCT GGAAGCGGCC
GCACACGCTA TTCCGGTATT GATGGGGCCG CATACGTTTA ACTTTAAAGA CATTTGCGCG
CGGCTGGAGC AGGCAAGCGG GCTGATTACC GTTACCGATG CCACTACGCT TGCAAAAGAG
GTTTCCTCTT TACTCACCGA CGCCGATTAC CGTAGTTTCT ATGGCCGTCA TGCCGTTGAA
GTGCTGTATC AAAACCAGGG CGCACTACAG CGTCTGCTTC AACTGCTGGA ACCTTACCTG
CCACCGAAAA CGCATTGA
 
Protein sequence
MLELLYTALL YLIQPLIWIR LWVRGRKAPA YRKRWGERYG FYRHPLKPGG IMLHSVSVGE 
TLAAIPLVRA LRHRYPDLPI TVTTMTPTGS ERVQSAFGKD VQHVYLPYDL PDALNRFLNK
VDPKLVLIME TELWPNLIAA LHKRKIPLVI ANARLSARSA AGYAKLGKFV RRLLRRITLI
AAQNEEDGAR FVSLGAKNNQ VTVTGSLKFD ISVTPQLAAK AVTLRRQWAP HRPVWIATST
HEGEESVVIA AHQALLQQFP NLLLILVPRH PERFPDAINL VRQAGLSYIT RSSGEVPSTS
TQVVVGDTMG ELMLLYGIAD LAFVGGSLVE RGGHNPLEAA AHAIPVLMGP HTFNFKDICA
RLEQASGLIT VTDATTLAKE VSSLLTDADY RSFYGRHAVE VLYQNQGALQ RLLQLLEPYL
PPKTH