Gene EcHS_A3837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3837 
SymbolrfaJ2 
ID5592840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3832695 
End bp3833690 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content35% 
IMG OID640922949 
Productlipopolysaccharide 1,2-glucosyltransferase 
Protein accessionYP_001460427 
Protein GI157163109 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0000031402 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAT TTATAAAAGA ACGGTTTTCG TATTTAGCAG ATAATAAAAA AGAAAACGCC 
CCAGAGCTAA ATGTTTCCTA CGGTATCGAT AAGAATTTTT TGTATGGTGC TGGCGTTTCA
ATTTCTTCCG TTTTGATTAA TAATTCAGAT ATTAATTTTG TCTTTCATGT TTTCACTGAT
TATGTGGATG ATAATTATTT AAAGTCATTT AATGAAACAG CAAAACAATT TAATACCTCA
ATTATTGTAT ATTTAATTGA CCCCAAATAC TTTGCTGATC TGCCGACGTC ACAGTTTTGG
TCGTACGCGA CATACTTCAG GGTATTGTCT TTTGAATATC TGAGTGAAAG TATTTCCACA
CTGCTGTATC TGGATGCCGA TGTTGTTTGT AAAGGAAGCC TGAAACCTCT CACAGAAATT
ATATTTAAAG ATGAGTTTGC TGCGGTCATT CCTGACAATG ATAGTACTCA GGCGGCATGT
GCAAAACGCC TCAACATTCC CGAAATGAAT GGACGTTATT TCAATGCAGG CGTTATCTAT
GTCAATCTTA AAAAATGGCA TGAAGCAAAT TTGACACCGT ATTTACTCAC GCTTTTACGA
GGGGAAACTA AATATGGCTC TCTTAAATAT TTAGATCAGG ATGCGTTGAA TATCGCATTT
AATATGAATA ATATCTACCT CGCGAAGGAT TTTGATACTA TTTATACCCT GAAAAACGAA
CTTCATGATC GTAGTCATCG AAAGTATCAG CAAACCATTA CTGATAAAAC AGTATTGATT
CACTATACAG GGATAACTAA ACCATGGCAT AGCTGGGCTG GATATCCGTC TGCATCATAC
TTTAATATCG CGCGTGAACA ATCTCCCTGG AAGAAATATC CTCTTAAAGA GGCGCGGACT
GTTGCAGAAA TGCAGAAACA ATATAAGCAT CTGTTTGCCC ATGGTGAGTA TATTAAAGGT
ATAACTTCAT TAATTAAGTA CAAGCTTAAG AAATAA
 
Protein sequence
MNEFIKERFS YLADNKKENA PELNVSYGID KNFLYGAGVS ISSVLINNSD INFVFHVFTD 
YVDDNYLKSF NETAKQFNTS IIVYLIDPKY FADLPTSQFW SYATYFRVLS FEYLSESIST
LLYLDADVVC KGSLKPLTEI IFKDEFAAVI PDNDSTQAAC AKRLNIPEMN GRYFNAGVIY
VNLKKWHEAN LTPYLLTLLR GETKYGSLKY LDQDALNIAF NMNNIYLAKD FDTIYTLKNE
LHDRSHRKYQ QTITDKTVLI HYTGITKPWH SWAGYPSASY FNIAREQSPW KKYPLKEART
VAEMQKQYKH LFAHGEYIKG ITSLIKYKLK K