Gene B21_03996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03996 
SymbolyjeF 
ID8115780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4298676 
End bp4300223 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content57% 
IMG OID644850148 
Producthypothetical protein 
Protein accessionYP_003001721 
Protein GI251787417 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000718963 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACC ATACAATGAA GAAAAACCCC GTAAGTATAC CACACACCGT CTGGTACGCC 
GACGATATCC GCCGCGGAGA ACGCGAGGCG GCAGATGCGC TGGGGCTCAC ACTCTATGAG
CTGATGCTTC GCGCTGGCGA AGCAGCATTC CAGGTGTGTC GTTCGGCTTA TCCTGACGCC
CGCCACTGGC TGGTGCTGTG CGGTCATGGT AATAACGGCG GCGATGGCTA TGTGGTCGCG
CGGCTGGCCA CAGCGGTCGG TATTGAGGTC ACGCTGCTGG CCCAGGAAAG TGACAAACCG
TTGCCGGAAG AGGCCGCGCT GGCACGCGAA GCATGGTTAA ACGCGGGTGG CGAGATCCAT
GCTTCGAATA TTGTCTGGCC CGAATCGGTA GATCTGATTG TTGATGCGCT GCTCGGTACC
GGCTTGCAGC AAGCGCCCCG CGAATCCATT AGCCAGTTAA TCGACCACGC TAATTCCCAT
CCTGCGCCGA TTGCGGCGGT TGATATCCCT TCCGGCCTGC TGGCTGAAAC CGGCGCTACG
CCAGGCGCAG TGATCAACGC CGATCACACC ATCACTTTTA TTGCGCTGAA ACCAGGCTTG
CTCACTGGAA AAGCGCGGGA TGTTACAGGA CAACTGCATT TTGACTCACT GGGGCTGGAT
AGCTGGCTGG CAGGTCAGGA GACGAAAATT CAGCGGTTTT CGGCAGAACA ACTTTCTCAC
TGGCTGAAAC CGCGTCGCCC GACTTCGCAT AAAGGCGATC ACGGGCGGCT GGTGATTATC
GGTGGCGATC ACGGCACTGC GGGGGCTATT CGTATGACGG GGGAAGCGGC GCTACGTGCT
GGTGCTGGTT TAGTCCGAGT ACTGACCCGC AGTGAAAACA TTGCGCCGCT GCTGACTGCA
CGACCAGAAT TGATGGTGCA TGAACTGACG ATGGACTCTC TTGCCGAAAG CCTGGAATGG
GCCGATGTGG TGGTGATTGG TCCCGGTCTG GGCCAGCAAG AGTGGGGGAA AAAAGCCCTG
CAAAAAGTTG AGAATTTTCG CAAACCGATG TTGTGGGATG CCGATGCATT GAACCTGCTG
GCAATCAATC CCGATAAGCG TCACAATCGC GTGATCACGC CGCATCCTGG CGAGGCCGCA
CGGTTGTTAG GCTGTTCCGT CGCTGAAATT GAAAGTGACC GCTTACATTG CGCCAAACGT
CTGGTACAAC GTTATGGCGG CGTAGCGGTG CTGAAAGGTG CCGGAACCGT GGTCGCCGCC
CATCCTGACG CTTTAGGCAT TATTGATGTC GGAAATGCAG GCATGGCGAG CGGCGGCATG
GGCGATGTGC TCTCTGGTAT TATTGGCGCA TTGCTTGGGC AAAAACTGTC GCCGTATGAT
GCAGCCTGTG CAGGCTGTGT TGCGCACGGT GCGGCAGCTG ACGTACTGGC GGCGCGTTTT
GGAACGCGCG GGATGCTGGC AACCGATCTC TTTTCCACGC TACAGCGTAT TGTTAACCCG
GAAGTGACTG ATAAAAACCA TGATGAATCG AGTAATCCCG CTCCCTGA
 
Protein sequence
MTDHTMKKNP VSIPHTVWYA DDIRRGEREA ADALGLTLYE LMLRAGEAAF QVCRSAYPDA 
RHWLVLCGHG NNGGDGYVVA RLATAVGIEV TLLAQESDKP LPEEAALARE AWLNAGGEIH
ASNIVWPESV DLIVDALLGT GLQQAPRESI SQLIDHANSH PAPIAAVDIP SGLLAETGAT
PGAVINADHT ITFIALKPGL LTGKARDVTG QLHFDSLGLD SWLAGQETKI QRFSAEQLSH
WLKPRRPTSH KGDHGRLVII GGDHGTAGAI RMTGEAALRA GAGLVRVLTR SENIAPLLTA
RPELMVHELT MDSLAESLEW ADVVVIGPGL GQQEWGKKAL QKVENFRKPM LWDADALNLL
AINPDKRHNR VITPHPGEAA RLLGCSVAEI ESDRLHCAKR LVQRYGGVAV LKGAGTVVAA
HPDALGIIDV GNAGMASGGM GDVLSGIIGA LLGQKLSPYD AACAGCVAHG AAADVLAARF
GTRGMLATDL FSTLQRIVNP EVTDKNHDES SNPAP