Gene EcSMS35_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3956 
SymbolrfaD 
ID6147033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4033982 
End bp4034914 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content51% 
IMG OID641618782 
ProductADP-L-glycero-D-mannoheptose-6-epimerase 
Protein accessionYP_001745921 
Protein GI170683119 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000259209 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATCG TTACCGGCGG CGCGGGCTTT ATCGGCAGCA ACATCGTTAA AGCCCTGAAT 
GACAAAGGCA TCACCGATAT TCTGGTGGTG GATAACCTGA AAGACGGCAC CAAGTTTGTG
AACCTGGTGG ATCTGGATAT CGCGGACTAT ATGGATAAGG AAGACTTCCT GATCCAGATT
ATGGCTGGCG AAGAGTTCGG CGATGTCGAA GCGATTTTCC ACGAAGGCGC GTGCTCTTCC
ACCACCGAGT GGGACGGCAA GTATATGATG GATAACAACT ATCAATACTC CAAAGAGCTG
CTGCACTACT GCCTGGAGCG CGAAATCCCG TTGCTGTACG CTTCTTCTGC AGCCACTTAC
GGCGGACGCA CTTCCGACTT TATTGAATCC CGCGAGTACG AAAAACCACT TAACGTCTAC
GGTTATTCGA AGTTCCTGTT TGATGAATAT GTTCGCCAAA TCCTGCCGGA AGCCAACTCG
CAGATTGTTG GCTTCCGTTA TTTCAACGTT TATGGACCGC GTGAAGGCCA TAAAGGCAGC
ATGGCGAGCG TCGCTTTCCA TCTCAACACC CAGCTTAATA ACGGTGAATC ACCGAAGCTG
TTCGAAGGTA GCGAGAACTT CAAACGTGAC TTCGTCTATG TGGGCGATGT GGCAGATGTA
AACCTGTGGT TCCTGGAAAA TGGCGTTTCC GGCATCTTCA ACCTCGGCAC CGGTCGTGCG
GAATCCTTCC AGGCAGTAGC AGATGCTACA CTTGCTTATC ACAAGAAAGG CCAAATCGAA
TACATTCCGT TCCCGGATAA ACTGAAAGGC CGCTACCAGG CGTTCACGCA GGCAGATCTG
ACGAATCTGC GCGCGGCGGG TTACGATAAA CCGTTCAAAA CCGTTGCCGA AGGCGTAACG
GAATACATGG CCTGGCTGAA TCGTGACGCA TAA
 
Protein sequence
MIIVTGGAGF IGSNIVKALN DKGITDILVV DNLKDGTKFV NLVDLDIADY MDKEDFLIQI 
MAGEEFGDVE AIFHEGACSS TTEWDGKYMM DNNYQYSKEL LHYCLEREIP LLYASSAATY
GGRTSDFIES REYEKPLNVY GYSKFLFDEY VRQILPEANS QIVGFRYFNV YGPREGHKGS
MASVAFHLNT QLNNGESPKL FEGSENFKRD FVYVGDVADV NLWFLENGVS GIFNLGTGRA
ESFQAVADAT LAYHKKGQIE YIPFPDKLKG RYQAFTQADL TNLRAAGYDK PFKTVAEGVT
EYMAWLNRDA