Gene EcE24377A_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4133 
SymbolrfaQ 
ID5590050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4123005 
End bp4124027 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content47% 
IMG OID640927752 
Productlipopolysaccharide core biosynthesis protein 
Protein accessionYP_001465112 
Protein GI157157104 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02201] lipopolysaccharide heptosyltransferase III, putative 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000187228 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTC ATGGGGATAT GTTATTAACT ACTCCCGTCA TTAGTTCGCT GAAAAAAAAT 
TACCCTGACG CAAAAATCGA TGTGCTGCTT TATCAGGACA CCATCCCGAT CCTGTCTGAA
AATCCAGAGA TTAACGCGCT CTACGGCATA AAAAATAAAA AAGCAAAAGC CTCAGAAAAA
ATTGCCAACT TTTTTCATCT CATCAAGGTA TTACGTGCCA ATAAGTATGA CCTTATCGTC
AATCTTACCG ATCAATGGAT GGTTGCTATA CTGGTTCGCT TATTAAATGC TCGTGTGAAA
ATTTCCCAGG ATTATCATCA TCGGCAGTCT GCTTTTTGGC GTAAAAGTTT CACCCATTTG
GTGCCGTTGC AGGGTGGAAA TGTGGTGGAA AGTAACTTAT CCGTGCTGAC ACCATTGGGA
CTTGATTCGT TGGTGAAGCA GACAACCATG AGTTACCCGC CTGCAAGCTG GAAACGTATG
CGTCGCGAAC TTGATCACGC TGGTGTTGGA CAAAATTATG TGGTTATCCA ACCTACGGCG
CGGCAAATCT TCAAATGCTG GGACAACGCC AAGTTTTCCG CTGTGATTGA TGCCTTACAT
GCTCGTGGTT ATGAAGTCGT TCTGACGTCC GGCCCAGATA AAGACGATCT GGCCTGCGTC
AATGAAATTG CGCAGGGATG CCAGACGCCA CCAGTAACGG CGCTGGCTGG AAAGGTGACC
TTCCCGGAAC TTGGTGCGTT AATCGATCAT GCGCAGCTGT TTATTGGCGT TGATTCCGCA
CCGGCGCATA TTGCCGCTGC AGTTAATACG CCGCTGATAT CGCTGTTTGG CGCAACAGAT
CATATTTTCT GGCGTCCCTG GTCAAATAAC ATGATTCAAT TCTGGGCGGG AGATTACCGG
GAAATGCCAA CGCGCGATCA GCGTGACCGA AATGAGATGT ATCTTTCGGT TATTCCGGCG
GCAGATGTCA TTGCTGCTGT CGATAAATTA CTGCCCTCCT CCACGACAGG TACGTCGTTA
TGA
 
Protein sequence
MRFHGDMLLT TPVISSLKKN YPDAKIDVLL YQDTIPILSE NPEINALYGI KNKKAKASEK 
IANFFHLIKV LRANKYDLIV NLTDQWMVAI LVRLLNARVK ISQDYHHRQS AFWRKSFTHL
VPLQGGNVVE SNLSVLTPLG LDSLVKQTTM SYPPASWKRM RRELDHAGVG QNYVVIQPTA
RQIFKCWDNA KFSAVIDALH ARGYEVVLTS GPDKDDLACV NEIAQGCQTP PVTALAGKVT
FPELGALIDH AQLFIGVDSA PAHIAAAVNT PLISLFGATD HIFWRPWSNN MIQFWAGDYR
EMPTRDQRDR NEMYLSVIPA ADVIAAVDKL LPSSTTGTSL