Gene EcE24377A_4130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4130 
SymbolrfaI 
ID5586429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4120062 
End bp4121078 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content38% 
IMG OID640927749 
Productlipopolysaccharide 1,3-galactosyltransferase 
Protein accessionYP_001465109 
Protein GI157159286 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00481358 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCC ACTATTTTAA TCCACAAGAG ATGATCAATA AGACAATCAT CTTCGATGAA 
AGGCCAGCGG CGTCAGTGGC ATCATCATTC CATGTTGCTT ATGGCATTGA TAAAAACTTT
CTTTTTGGTT GTGGTGTTTC AATCACGTCA GTTTTGTTAC ATAACAACGA CGTGAGTTTT
GTTTTCCACG TTTTTATTGA TGATATCCCT GAAGCCGATA TCCAGCGTTT AGCCCAATTG
GCGAAAAGCT ATCGTACCTG TATCCAGATC CATCTAGTAA ATTGTGAACG GCTTAAGGCA
TTACCGACGA CCAAAAATTG GTCTATTGCC ATGTATTTCC GTTTTGTAAT TGCAGATTAC
TTTATTGATC AACAAGATAA GATCCTTTAC CTGGATGCTG ATATCGCCTG TCAGGGAAAC
TTAAAGCCGC TGATAACAAT GGATCTTGCC AATAACGTTG CTGCTGTTGT TACTGAACGC
GATGCTAACT GGTGGTCGTT ACGGGGTCAA AGTCTGCAGT GTAATGAACT TGAAAAGGGT
TACTTTAATT CAGGTGTCCT GTTAATTAAT ACACTAGCGT GGGCGCAGGA GTCCGTTTCT
GCTAAAGCGA TGTCGATGCT TGCTGATAAA GCCATCGTTT CCCGTTTAAC CTATATGGAT
CAAGATATCC TTAATCTTAT CCTGTTAGGG AAAGTTAAAT TCATTGATGC TAAATACAAT
ACGCAATTTA GTTTAAATTA TGAATTAAAA AAATCATTTG TTTGTCCAAT TAATGATGAA
ACCGTATTAA TTCATTATGT CGGCCCGACA AAACCCTGGC ATTACTGGGC CGGTTATCCA
AGTGCGCAAC CTTTTATCAA AGCCAAAGAA GCATCGCCCT GGAAAAATGA ACCGTTAATG
CGGCCAGTTA ACTCAAACTA TGCTCGTTAT TGCGCCAAGC ATAATTTTAA ACAAAACAAA
CCAATTAACG GGATAATGAA TTATATTTAT TATTTTTATT TAAAGATAAT AAAATGA
 
Protein sequence
MSAHYFNPQE MINKTIIFDE RPAASVASSF HVAYGIDKNF LFGCGVSITS VLLHNNDVSF 
VFHVFIDDIP EADIQRLAQL AKSYRTCIQI HLVNCERLKA LPTTKNWSIA MYFRFVIADY
FIDQQDKILY LDADIACQGN LKPLITMDLA NNVAAVVTER DANWWSLRGQ SLQCNELEKG
YFNSGVLLIN TLAWAQESVS AKAMSMLADK AIVSRLTYMD QDILNLILLG KVKFIDAKYN
TQFSLNYELK KSFVCPINDE TVLIHYVGPT KPWHYWAGYP SAQPFIKAKE ASPWKNEPLM
RPVNSNYARY CAKHNFKQNK PINGIMNYIY YFYLKIIK