Gene EcSMS35_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3958 
SymbolrfaC 
ID6143135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4035974 
End bp4036945 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content51% 
IMG OID641618784 
ProductADP-heptose:LPS heptosyl transferase I 
Protein accessionYP_001745923 
Protein GI170682077 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0259947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTTC TGATCGTTAA AACATCGTCG ATGGGCGATG TTCTCCATAC GCTGCCCGCA 
CTCACTGATG CCCAGCAGGC AATCCCAGGG ATTAAGTTTG ACTGGGTGGT GGAAGAAGGG
TTCACACAGA TTCCTTCCTG GCACGCCGCC GTTGAGCGAG TTATTCCTGT GGCAATACGT
CGCTGGCGTA AAGCCTGGTT CTCGGCCCCC ATAAAAGCTG AACGCAAAGC GTTTCGTGAA
ACGCTACAAG CAGAGAATTA TGACGCAGTT ATCGACGCTC AGGGGCTGGT AAAAAGCGCA
GCGCTGGTGA CGCGTCTGGC GCATGGCGTA AAGCATGGAA TGGACTGGCA AACCGCTCGC
GAACCGTTAG CCAGCCTGTT TTACAATCGT AAACATCATA TTGCAAAACA GCAGCACGCC
GTAGAACGCA CCCGCGAACT GTTTGCCAAA AGTCTGGGCT ATAGCAAACC ACAAACCCAG
GGCGATTATG CTATCGCACA GCATTTCCTG ACGAACCTGC CTACAGATGC TGGCGAATAT
GCCGTATTTC TTCATGCAAC AACCCGAGAT GATAAACACT GGCCGGAAGA ACACTGGCGA
GAATTGATTG GTTTACTGGC TGATTCAGGA ATACGGATTA AACTTCCGTG GGGCGCGCCG
CATGAGGAAG AACGGGCGAA ACGACTGGCG GAAGGATTTG CTTATGTTGA AGTATTGCCG
AAGATGAGTC TGGAAGGCGT TGCCCGCGTA CTGGCTGGGG CTAAATTTGT AGTGTCGGTG
GATACGGGGT TAAGCCATTT AACGGCGGCA CTGGATAGAC CCAATATCAC GGTTTATGGA
CCTACCGATC CGGGACTAAT TGGTGGGTAT GGGAAGAATC AGATGGTTTG TAGGGCTCCG
GGGAATGAGT TGTCTCAATT GACAGCAAAT GCTGTTAAGC GGTTCATTGA AGAAAACGCT
GCCATGATTT AA
 
Protein sequence
MRVLIVKTSS MGDVLHTLPA LTDAQQAIPG IKFDWVVEEG FTQIPSWHAA VERVIPVAIR 
RWRKAWFSAP IKAERKAFRE TLQAENYDAV IDAQGLVKSA ALVTRLAHGV KHGMDWQTAR
EPLASLFYNR KHHIAKQQHA VERTRELFAK SLGYSKPQTQ GDYAIAQHFL TNLPTDAGEY
AVFLHATTRD DKHWPEEHWR ELIGLLADSG IRIKLPWGAP HEEERAKRLA EGFAYVEVLP
KMSLEGVARV LAGAKFVVSV DTGLSHLTAA LDRPNITVYG PTDPGLIGGY GKNQMVCRAP
GNELSQLTAN AVKRFIEENA AMI