Gene EcSMS35_3963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3963 
SymbolrfaJ2 
ID6144753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4041135 
End bp4042130 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content35% 
IMG OID641618789 
Productlipopolysaccharide 1,2-glucosyltransferase 
Protein accessionYP_001745928 
Protein GI170679827 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.170676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAT TTATAAAAGA ACGGTTTTCG TATTTAGCAG ATAATAAAAA AGAAAACGTC 
CCAGAGCTAA ATGTTTCCTA CGGTATCGAT AAGAATTTTT TGTATGGTGC TGGCGTTTCA
ATTTCTTCCG TTTTGATTAA TAATTCAGAT ATTAATTTTG TCTTTCATGT TTTCACTGAT
TATGTGGATG ATGATTATTT AAAGTCATTT AATGAAACAG CAAAACAATT TAATACCTCA
ATTATTGTAT ATTTAATTGA CCCAAAATAC TTTGCTGATC TGCCGACGTC ACAGTTTTGG
TCGTACGCGA CATACTTCAG GGTATTGTCC TTTGAATATC TGAGTGAAAG TATTTCCACA
CTGCTGTATC TGGATGCCGA TGTCGTTTGT AAAGGAAGTC TGAAACCTCT CACAAAAATT
ATATTTAAAG ATGAGTTTGC TGCGGTTATT CCTGACAATG ATAGTACTCA GGCGGCATGT
GCAAAACGTC TTAACATTCC CGAAATGAAT GGACGTTATT TCAATGCAGG CGTTATCTAT
GTCAATCTTA AAAAATGGCA TGAAGCAAAT TTGACACCGT ATTTACTCAC GCTTTTACGA
GGGGAAACTA AATATGGCTC TCTTAAATAT TTAGATCAGG ATGCGTTGAA TATCGCATTT
AATATGAATA ATATCTACCT CGCGAAGGAT TTTGATACTA TTTATACCCT GAAAAACGAA
CTTCATGATC GTAGTCATCG AAAGTATCAG CAAACCATTA CCGATAAAAC AGTATTGATT
CACTATACAG GGATAACTAA ACCATGGCAT AGCTGGGCTG GATATCCGTC TGCATCATAC
TTTAATATCG CGCGTGAACA ATCTCCCTGG AAGAAATATC CTCTTAAAGA GGCGCGGACT
GTTGCAGAAA TGCAGAAACA ATATAAGCAT CTGTTTGCCC ATGGTGAGTA TATAAAAGGC
ATAACTTCAT TAATTAAGTA CAAGCTTAAG AAATAA
 
Protein sequence
MNEFIKERFS YLADNKKENV PELNVSYGID KNFLYGAGVS ISSVLINNSD INFVFHVFTD 
YVDDDYLKSF NETAKQFNTS IIVYLIDPKY FADLPTSQFW SYATYFRVLS FEYLSESIST
LLYLDADVVC KGSLKPLTKI IFKDEFAAVI PDNDSTQAAC AKRLNIPEMN GRYFNAGVIY
VNLKKWHEAN LTPYLLTLLR GETKYGSLKY LDQDALNIAF NMNNIYLAKD FDTIYTLKNE
LHDRSHRKYQ QTITDKTVLI HYTGITKPWH SWAGYPSASY FNIAREQSPW KKYPLKEART
VAEMQKQYKH LFAHGEYIKG ITSLIKYKLK K