Gene EcHS_A3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3838 
SymbolrfaI 
ID5593296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3833707 
End bp3834723 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content38% 
IMG OID640922950 
Productlipopolysaccharide 1,3-galactosyltransferase 
Protein accessionYP_001460428 
Protein GI157163110 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00000851053 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCC ACTATTTTAA TCCACAAGAG ATGATCAATA AGACAATCAT CTTCGATGAA 
AGGCCAGCGG CGTCAGTGGC ATCATCATTC CATGTTGCTT ATGGCATTGA TAAAAACTTT
CTTTTTGGTT GTGGTGTTTC AATCACGTCA GTTTTGTTAC ATAACAACGA CGTGAGTTTT
GTTTTCCACG TTTTTATTGA TGATATCCCT GAAGCCGATA TCCAGCGTTT AGCCCAATTG
GCGAAAAGCT ATCGTACCTG TATCCAGATC CATCTGGTAA ATTGTGAACG GCTTAAGGCA
TTACCGACGA CCAAAAATTG GTCTATTGCC ATGTATTTCC GTTTTGTAAT TGCAGATTAC
TTTATTGATC AACAAGATAA GATCCTTTAC CTGGATGCTG ATATCGCCTG TCAGGGAAAC
TTAAAGCCGC TGATAACAAT GGATCTTGCC AATAACGTTG CTGCTGTTGT TACTGAACGC
GATGCTAACT GGTGGTCGTT ACGGGGTCAA AGTCTGCAGT GTAATGAACT TGAAAAGGGT
TACTTTAATT CAGGTGTCCT GTTAATTAAT ACACTAGCGT GGGCGCAGGA GTCCGTTTCT
GCTAAAGCGA TGTCGATGCT TGCTGATAAA GCCATCGTTT CCCATTTAAC CTATATGGAT
CAAGATATCC TTAATCTTAT CCTGTTAGGG AAAGTTAAAT TCATTGATGC TAAATACAAT
ACGCAGTTTA GTTTAAATTA TGAATTAAAA AAATCATTTG TTTGTCCAAT TAATGATGAA
ATCGTATTAA TTCATTATGT CGGCCCGACA AAACCCTGGC ATTACTGGGC CAGTTATCCA
AGTGCGCAAC CATTTATCAA AGCCAAAGAA GCATCGCCCT GGAAAAATGA ACCGTTAATG
CGGCCAGTTA ACTCAAACTA TGCTCGTTAT TGCGCCAAGC ATAATTTTAA ACAAAATAAA
CCAATTAACG GGATAATGAA TTATATTTAT TATTTTTATT TAAAGATAAT AAAATGA
 
Protein sequence
MSAHYFNPQE MINKTIIFDE RPAASVASSF HVAYGIDKNF LFGCGVSITS VLLHNNDVSF 
VFHVFIDDIP EADIQRLAQL AKSYRTCIQI HLVNCERLKA LPTTKNWSIA MYFRFVIADY
FIDQQDKILY LDADIACQGN LKPLITMDLA NNVAAVVTER DANWWSLRGQ SLQCNELEKG
YFNSGVLLIN TLAWAQESVS AKAMSMLADK AIVSHLTYMD QDILNLILLG KVKFIDAKYN
TQFSLNYELK KSFVCPINDE IVLIHYVGPT KPWHYWASYP SAQPFIKAKE ASPWKNEPLM
RPVNSNYARY CAKHNFKQNK PINGIMNYIY YFYLKIIK