Gene EcSMS35_3967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3967 
SymbolrfaQ 
ID6146256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4045090 
End bp4046148 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content47% 
IMG OID641618793 
Productlipopolysaccharide core biosynthesis protein 
Protein accessionYP_001745932 
Protein GI170682581 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02201] lipopolysaccharide heptosyltransferase III, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00179779 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATAAGC CATTTCGAAG AATTTTGCTC ATTAAGATGC GTTTTCATGG GGATATGTTA 
TTAACTACTC CCGTCATTAG TTCGCTGAAA AAAAATTACC CTGACGCAAA AATCGATGTG
CTGCTTTATC AGGACACCAT CCCGATCCTG TCTGAAAATC CAGAGATTAA CGCGCTCTAC
GGCATAAAAA ATAAAAAAGC AAAAGCCTCA GAAAAAATTG CCAACTTTTT CCATCTCATC
AAGATATTAC GTGCCAATAA GTATGACCTT ATCGTTAATC TTACCGATCA ATGGATGGTT
GCTATACTGG TTCGCTTATT AAATGCCCGT GTGAAAATTT CCCAGGATTA TCATCATCGG
CAGTCTGCTT TTTGGCGTAA TAGTTTCACT CATTTGGTGC CGTTGCAGGG TGGAAATGTG
GTGGAAAGTA ACTTATCCGT GCTGACGCCA TTGGGACTTG AATCGTTGGT GAAGCAGACA
ACCATGAGTT ACCCGCCTGC AAGCTGGAAA CGTATGCGTC GCGAACTTGA TCACGCTGGT
GTTGGACAAA ATTATGTGGT TATCCAACCT ACGGCGCGGC AAATCTTCAA ATGCTGGGAC
AACGCCAAGT TTTCCGCTGT GATTGATGCC TTACATGCTC GTGGTTATGA AGTCGTTCTG
ACGTCCGGCC CAGATAAAGA CGATCTGGCC TGCGTCAATG AAATTGCGCA GGGATGCCAG
ACGCCACCAG TAACGGCGCT GGCCGGAAAG GTGACCTTCC CGGAACTTGG TGCGTTAATC
GATCATGCGC AGCTGTTTAT TGGCGTTGAT TCCGCACCGG CGCATATTGC CGCTGCAGTT
AATACGCCGC TGATATCGCT GTTTGGTGCG ACAGACCATA TTTTCTGGCG TCCCTGGTCA
AATAACATGA TTCAATTCTG GGCGGGAGAT TACCGGGAAA TGCCAACGCG CGATCAGCGT
GACCGAAATG AGATGTATCT TTCGGTTATT CCGGCGGCAG ATGTCATTGC AGCTGTCGAT
AAATTACTTC CCTCCTCCAC GACAGGTACG TCGTTATGA
 
Protein sequence
MDKPFRRILL IKMRFHGDML LTTPVISSLK KNYPDAKIDV LLYQDTIPIL SENPEINALY 
GIKNKKAKAS EKIANFFHLI KILRANKYDL IVNLTDQWMV AILVRLLNAR VKISQDYHHR
QSAFWRNSFT HLVPLQGGNV VESNLSVLTP LGLESLVKQT TMSYPPASWK RMRRELDHAG
VGQNYVVIQP TARQIFKCWD NAKFSAVIDA LHARGYEVVL TSGPDKDDLA CVNEIAQGCQ
TPPVTALAGK VTFPELGALI DHAQLFIGVD SAPAHIAAAV NTPLISLFGA TDHIFWRPWS
NNMIQFWAGD YREMPTRDQR DRNEMYLSVI PAADVIAAVD KLLPSSTTGT SL