Gene B21_02948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02948 
SymbolkbaZ 
ID8115355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3144314 
End bp3145594 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID644849133 
Producthypothetical protein 
Protein accessionYP_003000706 
Protein GI251786402 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4573] Predicted tagatose 6-phosphate kinase 
TIGRFAM ID[TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACATC TGACAGAAAT GGTGAGACAG CACAAAGCGG GCAAAACAAA TGCAATTTAT 
GCCGTTTGTT CCGCACATCC GCTGGTGCTG GAAGCTGCAA TCCGCTACGC CAGTGCAAAC
CAAACGCCGT TACTGATTGA AGCAACCTCC AATCAGGTAG ACCAGTTCGG CGGTTATACC
GGAATGACGC CCGCCGATTT TCGCGGCTTT GTTTGTCAGC TCGCCGACTC GTTGAATTTC
CCGCAGGATG CGTTGATTCT GGGTGGTGAC CATCTGGGGC CAAACCGCTG GCAAAACCTG
CCAGCCGCTC AGGCAATGGC CAATGCCGAT GATTTGATTA AAAGCTACGT TGCGGCAGGA
TTCAAAAAAA TCCACCTTGA TTGCAGCATG TCCTGTCAGG ACGATCCAAT TCCCTTAACT
GATGACATCG TGGCTGAACG CGCCGCCCGT CTGGCGAAAG TGGCGGAAGA AACCTGTCTT
GAACACTTTG GCGAAGCCGA TCTGGAGTAT GTCATTGGTA CCGAAGTGCC GGTACCTGGC
GGCGCGCATG AAACCTTAAG CGAGCTGGCG GTCACCACGC CGGATGCCGC CCGCGCCACG
CTGGAAGCCC ATCGTCACGC CTTTGAAAAG CAAGGTTTGA ATGCCATCTG GCCACGCATC
ATTGCCCTGG TGGTTCAACC CGGCGTCGAA TTCGATCACA CCAACGTTAT TGATTATCAG
CCCGCCAAAG CGAGCGCCTT AAGCCAGATG GTCGAAAACT ACGAAACGCT GATTTTCGAA
GCGCACTCTA CCGATTATCA AACGCCGCAA TCGCTGCGCC AGCTGGTGAT TGACCACTTT
GCCATTCTGA AAGTTGGCCC AGCGCTGACC TTCGCCCTGC GTGAAGCTCT GTTCTCTCTG
GCGGCGATTG AAGAAGAACT GGTGCCAGCG AAAGCCTGTT CTGGTCTGCG TCAGGTGCTG
GAAGACGTGA TGCTCGACCG CCCGGAATAC TGGCAAAGCC ACTACCACGG TGACGGCAAC
GCGCGTCGTC TGGCGCGTGG TTATAGCTAC TCGGATCGCG TGCGCTATTA CTGGCCGGAC
AGCCAGATTG ATGACGCTTT CGCTCATCTG GTACGTAATC TGGCGGATTC ACCAATTCCG
CTGCCGCTGA TCAGCCAGTA TCTGCCGCTG CAGTACGTGA AAGTTCGCTC CGGCGAGCTG
CAGCCAACGC CACGGGAACT CATTATCAAC CATATTCAGG ACATCCTGGC GCAGTACCAC
ACAGCCTGTG AAGGCCAATA A
 
Protein sequence
MKHLTEMVRQ HKAGKTNAIY AVCSAHPLVL EAAIRYASAN QTPLLIEATS NQVDQFGGYT 
GMTPADFRGF VCQLADSLNF PQDALILGGD HLGPNRWQNL PAAQAMANAD DLIKSYVAAG
FKKIHLDCSM SCQDDPIPLT DDIVAERAAR LAKVAEETCL EHFGEADLEY VIGTEVPVPG
GAHETLSELA VTTPDAARAT LEAHRHAFEK QGLNAIWPRI IALVVQPGVE FDHTNVIDYQ
PAKASALSQM VENYETLIFE AHSTDYQTPQ SLRQLVIDHF AILKVGPALT FALREALFSL
AAIEEELVPA KACSGLRQVL EDVMLDRPEY WQSHYHGDGN ARRLARGYSY SDRVRYYWPD
SQIDDAFAHL VRNLADSPIP LPLISQYLPL QYVKVRSGEL QPTPRELIIN HIQDILAQYH
TACEGQ