Gene EcE24377A_4129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4129 
SymbolrfaJ 
ID5586431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4119050 
End bp4120045 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content35% 
IMG OID640927748 
Productlipopolysaccharide 1,2-glucosyltransferase 
Protein accessionYP_001465108 
Protein GI157154988 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0229706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAT TTATAAAAGA ACGGTTTTCG TATTTAGCAG ATAATAAAAA AGAAAACGCC 
CCAGAGCTAA ATGTTTCCTA CGGTATCGAT AAGAATTTTT TGTATGGTGC TGGCGTTTCA
ATTTCTTCCG TTTTGATTAA TAATTCAGAT ATTAATTTTG TCTTTCATGT TTTCACTGAT
TATGTGGATG ATGATTATTT AAAGTCATTT AATGAAACAG CAAAACAATT TAATACCTCA
ATTATTGTAT ATTTAATTGA CCCCAAATAC TTTGCTGATC TGCCGACGTC ACAGTTTTGG
TCGTACGCGA CATACTTCAG GGTATTGTCT TTTGAATATC TGAGTGAAAG TATTTCCACA
CTGCTGTATC TGGATGCCGA TGTTGTTTGT AAAGGAAGCC TGAAACCTCT CACAGAAATT
ATATTTAAAG ATGAGTTTGC TGCGGTCATT CCTGACAATG ATAGTACTCA GGCGGCATGT
GCAAAACGCC TCAACATTCC CGAAATGAAT GGACGTTATT TCAATGCAGG CGTTATCTAT
GTCAATCTTA AAAAATGGCA TGAAGCAAAT TTGACACCGT ATTTACTCAA ACTTTTACGA
GGGGAAACTA AATATGGCTC TCTTAAATAT TTAGATCAGG ATGCGTTGAA TATCGCATTT
AATATGAATA ATATCTACCT TGCGAAGGAT TTTGATACTA TTTATACCCT GAAAAACGAA
CTTTATGATC GTAGTCATCG AAAGTATCAG CAAACCATTA CCGATAAAAC AGTGTTGATT
CACTATACAG GGATAACTAA ACCATGGCAT AGCTGGGCTG GATATCCGTC TGCATCATAC
TTTAATATCG CGCGTGAACA ATCTCCCTGG AAGAAATATC CTCTTAAAGA GGCGCGGACT
GTTGCAGAAA TGCAGAAACA ATATAAGCAT CTGTTTGCCC ATGGTGAGTA TATAAAAGGC
ATAACTTCAT TAATTAAGTA CAAGCTTAAG AAATAA
 
Protein sequence
MNEFIKERFS YLADNKKENA PELNVSYGID KNFLYGAGVS ISSVLINNSD INFVFHVFTD 
YVDDDYLKSF NETAKQFNTS IIVYLIDPKY FADLPTSQFW SYATYFRVLS FEYLSESIST
LLYLDADVVC KGSLKPLTEI IFKDEFAAVI PDNDSTQAAC AKRLNIPEMN GRYFNAGVIY
VNLKKWHEAN LTPYLLKLLR GETKYGSLKY LDQDALNIAF NMNNIYLAKD FDTIYTLKNE
LYDRSHRKYQ QTITDKTVLI HYTGITKPWH SWAGYPSASY FNIAREQSPW KKYPLKEART
VAEMQKQYKH LFAHGEYIKG ITSLIKYKLK K