Gene B21_03441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03441 
SymbolrfaQ 
ID8115641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3671935 
End bp3672957 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content47% 
IMG OID644849614 
Producthypothetical protein 
Protein accessionYP_003001187 
Protein GI251786883 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02201] lipopolysaccharide heptosyltransferase III, putative 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.331205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTC ATGGGGATAT GTTATTAACG ACTCCCGTCA TTAGTTCGCT GAAAAAAAAT 
TACCCTGACG CAAAAATCGA TGTGCTGCTT TATCAGGACA CCATCCCGAT CCTGTCTGAA
AATCCAGAGA TTAACGCGCT CTACGGCATA AAAAATAAAA AAGCAAAAGC CTCAGAAAAA
ATTGCCAACT TTTTTCATCT CATCAAGGTA TTACGTGCCA ATAAGTATGA CCTTATCGTC
AATCTTACCG ATCAATGGAT GGTTGCTATA CTGGTTCGCT TATTAAATGC CCGTGTGAAA
ATTTCCCAGG ATTATCATCA TCGGCAGTCT GCTTTTTGGC GTAAAAGTTT CACCCATTTG
GTGCCGTTGC AGGGTGGAAA TGTGGTGGAA AGTAACTTAT CCGTGCTGAC ACCATTGGGA
CTTGATTCGT TGGTGAAGCA GACAACCATG AGTTACCCGC CTGCAAGCTG GAAACGTATG
CGTCGCGAAC TTGATCACGC TGGTGTTGGA CAAAATTATG TGGTTATCCA ACCTACGGCG
CGGCAAATCT TCAAATGCTG GGACAACGCC AAGTTTTCCG CTGTGATTGA TGCCTTACAT
GCTCGTGGTT ATGAAGTTGT TCTGACGTCC GGCCCGGATA AAGACGATCT GGCCTGCGTC
AATGAAATTG CGCAGGGATG CCAGACGCCA CCAGTAACGG CGCTGGCTGG AAAGGTGACC
TTCCCGGAAC TTGGTGCGTT AATCGATCAT GCGCAGCTGT TTATTGGCGT TGATTCCGCA
CCGGCGCATA TTGCCGCTGC AGTTAATACG CCGCTGATAT CGCTGTTTGG CGCAACAGAT
CATATTTTCT GGCGTCCCTG GTCAAATAAC ATGATTCAAT TCTGGGCGGG AGATTACCGG
GAAATGCCAA CGCGCGATCA GCGTGACCGA AATGAGATGT ATCTTTCGGC TATTCCGGCG
GCAGATGTCA TTGCTGCTGT CGATAAATTA CTGCCCTCCT CCACGACAGG TACGTCGTTA
TGA
 
Protein sequence
MRFHGDMLLT TPVISSLKKN YPDAKIDVLL YQDTIPILSE NPEINALYGI KNKKAKASEK 
IANFFHLIKV LRANKYDLIV NLTDQWMVAI LVRLLNARVK ISQDYHHRQS AFWRKSFTHL
VPLQGGNVVE SNLSVLTPLG LDSLVKQTTM SYPPASWKRM RRELDHAGVG QNYVVIQPTA
RQIFKCWDNA KFSAVIDALH ARGYEVVLTS GPDKDDLACV NEIAQGCQTP PVTALAGKVT
FPELGALIDH AQLFIGVDSA PAHIAAAVNT PLISLFGATD HIFWRPWSNN MIQFWAGDYR
EMPTRDQRDR NEMYLSAIPA ADVIAAVDKL LPSSTTGTSL