Gene ECD_02765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02765 
SymboltktA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2910502 
End bp2912493 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content56% 
IMG OID 
Producttransketolase 1, thiamin-binding 
Protein accessionACT44582 
Protein GI253978912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCAC GTAAAGAGCT TGCCAATGCT ATTCGTGCGC TGAGCATGGA CGCAGTACAG 
AAAGCCAAAT CCGGTCACCC GGGTGCCCCT ATGGGTATGG CTGACATTGC CGAAGTCCTG
TGGCGTGATT TCCTGAAACA CAACCCGCAG AATCCGTCCT GGGCTGACCG TGACCGCTTC
GTGCTGTCCA ACGGCCACGG CTCCATGCTG ATCTACAGCC TGCTGCACCT CACCGGTTAC
GATCTGCCGA TGGAAGAACT GAAAAACTTC CGTCAGCTGC ACTCTAAAAC TCCGGGTCAC
CCGGAAGTGG GTTACACCGC TGGTGTGGAA ACCACCACCG GTCCGCTGGG TCAGGGTATT
GCCAACGCAG TCGGTATGGC GATTGCAGAA AAAACGCTGG CGGCGCAGTT TAACCGTCCG
GGCCACGACA TTGTCGACCA CTACACCTAC GCCTTCATGG GCGACGGCTG CATGATGGAA
GGCATCTCCC ACGAAGTTTG CTCTCTGGCG GGTACGCTGA AGCTGGGTAA ACTGATTGCA
TTCTACGATG ACAACGGTAT TTCTATCGAT GGTCACGTTG AAGGCTGGTT CACCGACGAC
ACCGCAATGC GTTTCGAAGC TTACGGCTGG CACGTTATTC GCGACATCGA CGGTCATGAC
GCGGCATCTA TCAAACGCGC AGTAGAAGAA GCGCGCGCAG TGACTGACAA ACCTTCCCTG
CTGATGTGCA AAACCATCAT CGGTTTCGGT TCCCCGAATA AAGCCGGTAC CCACGACTCC
CACGGTGCGC CGCTGGGCGA CGCTGAAATT GCCCTGACCC GCGAACAACT GGGCTGGAAA
TATGCGCCGT TCGAAATCCC GTCTGAAATC TATGCTCAGT GGGATGCGAA AGAAGCAGGC
CAGGCGAAAG AATCCGCATG GAACGAGAAA TTCGCTGCTT ACGCGAAAGC TTATCCGCAG
GAAGCCGCTG AATTTACCCG CCGTATGAAA GGCGAAATGC CGTCTGACTT CGACGCTAAA
GCGAAAGAGT TCATCGCTAA ACTGCAGGCT AATCCGGCGA AAATCGCCAG CCGTAAAGCG
TCTCAGAATG CTATCGAAGC GTTCGGTCCG CTGTTGCCGG AATTCCTCGG CGGTTCTGCT
GACCTGGCGC CGTCTAACCT GACCCTGTGG TCTGGTTCTA AAGCAATCAA CGAAGATGCT
GCGGGTAACT ACATCCACTA CGGTGTTCGC GAGTTCGGTA TGACCGCGAT TGCTAACGGT
ATCTCCCTGC ACGGTGGCTT CCTGCCGTAC ACCTCCACCT TCCTGATGTT CGTGGAATAC
GCACGTAACG CCGTACGTAT GGCTGCGCTG ATGAAACAGC GTCAGGTGAT GGTTTACACC
CACGACTCCA TCGGTCTGGG CGAAGACGGC CCGACTCACC AGCCGGTTGA GCAGGTCGCT
TCTCTGCGCG TAACCCCGAA CATGTCTACA TGGCGTCCGT GTGACCAGGT TGAATCCGCG
GTCGCGTGGA AATACGGTGT TGAGCGTCAG GACGGCCCGA CCGCACTGAT CCTCTCCCGT
CAGAACCTGG CGCAGCAGGA ACGAACTGAA GAGCAACTGG CAAACATCGC GCGCGGTGGT
TATGTGCTGA AAGACTGCGC CGGTCAGCCG GAACTGATTT TCATCGCTAC CGGTTCAGAA
GTTGAACTGG CTGTTGCTGC CTACGAAAAA CTGACTGCCG AAGGCGTGAA AGCGCGCGTG
GTGTCCATGC CGTCTACCGA CGCATTTGAC AAGCAGGATG CTGCTTACCG TGAATCCGTA
CTGCCGAAAG CGGTTACTGC ACGCGTTGCT GTAGAAGCGG GTATTGCTGA CTACTGGTAC
AAGTATGTTG GCCTGAACGG TGCTATCGTC GGTATGACCA CCTTCGGTGA ATCTGCTCCG
GCAGAGCTGC TGTTTGAAGA GTTCGGCTTC ACTGTTGATA ACGTTGTTGC GAAAGCAAAA
GAACTGCTGT AA
 
Protein sequence
MSSRKELANA IRALSMDAVQ KAKSGHPGAP MGMADIAEVL WRDFLKHNPQ NPSWADRDRF 
VLSNGHGSML IYSLLHLTGY DLPMEELKNF RQLHSKTPGH PEVGYTAGVE TTTGPLGQGI
ANAVGMAIAE KTLAAQFNRP GHDIVDHYTY AFMGDGCMME GISHEVCSLA GTLKLGKLIA
FYDDNGISID GHVEGWFTDD TAMRFEAYGW HVIRDIDGHD AASIKRAVEE ARAVTDKPSL
LMCKTIIGFG SPNKAGTHDS HGAPLGDAEI ALTREQLGWK YAPFEIPSEI YAQWDAKEAG
QAKESAWNEK FAAYAKAYPQ EAAEFTRRMK GEMPSDFDAK AKEFIAKLQA NPAKIASRKA
SQNAIEAFGP LLPEFLGGSA DLAPSNLTLW SGSKAINEDA AGNYIHYGVR EFGMTAIANG
ISLHGGFLPY TSTFLMFVEY ARNAVRMAAL MKQRQVMVYT HDSIGLGEDG PTHQPVEQVA
SLRVTPNMST WRPCDQVESA VAWKYGVERQ DGPTALILSR QNLAQQERTE EQLANIARGG
YVLKDCAGQP ELIFIATGSE VELAVAAYEK LTAEGVKARV VSMPSTDAFD KQDAAYRESV
LPKAVTARVA VEAGIADYWY KYVGLNGAIV GMTTFGESAP AELLFEEFGF TVDNVVAKAK
ELL