Gene EcSMS35_3077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3077 
SymboltktA 
ID6144686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3165813 
End bp3167804 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content56% 
IMG OID641617945 
Producttransketolase 
Protein accessionYP_001745096 
Protein GI170681853 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00342437 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCTCAC GTAAAGAGCT TGCCAATGCT ATTCGTGCGC TGAGCATGGA CGCAGTACAG 
AAAGCCAAAT CCGGTCACCC GGGTGCCCCT ATGGGTATGG CTGACATTGC CGAAGTCCTG
TGGCGTGATT TCCTGAAACA CAACCCGCAG AATCCGTCCT GGGCTGACCG TGACCGCTTC
GTGCTGTCCA ACGGCCACGG CTCCATGCTG ATCTACAGCC TGCTGCACCT CACCGGTTAC
GATCTGCCGA TGGAAGAACT GAAAAACTTC CGTCAGCTGC ACTCTAAAAC TCCGGGCCAC
CCGGAAGTGG GTTACACCGC TGGTGTGGAA ACCACTACAG GTCCGCTGGG TCAGGGTATT
GCCAACGCAG TCGGTATGGC GATTGCAGAA AAAACGCTGG CGGCGCAGTT TAACCGTCCG
GGCCACGACA TCGTCGACCA CTACACCTAC GCCTTCATGG GCGACGGCTG CATGATGGAA
GGCATCTCCC ACGAAGTTTG TTCTCTGGCG GGTACGCTGA AGCTGGGTAA ACTGATTGCG
TTCTACGATG ACAACGGTAT CTCTATCGAT GGTCACGTTG AAGGCTGGTT CACCGACGAC
ACCGCAATGC GTTTCGAAGC TTACGGCTGG CACGTTATTC GCGACATCGA CGGTCATGAC
GCGGCATCCA TCAAACGCGC AGTAGAAGAA GCGCGCGCAG TGACTGACAA ACCGTCCCTG
CTGATGTGCA AAACCATCAT CGGTTTCGGT TCCCCGAACA AAGCCGGTAC GCATGACTCC
CACGGTGCGC CGCTGGGCGA TGCTGAAATC GCCCTGACCC GCGAACAGCT GGGCTGGAAA
TATGCACCGT TCGAAATCCC GTCTGAAATC TATGCTCAGT GGGATGCGAA AGAAGCAGGC
CAGGCGAAAG AATCCGCATG GAACGAGAAA TTCGCTGCTT ACGCGAAAGC TTATCCGCAG
GAAGCCGCTG AATTTACCCG CCGTATGAAA GGCGAAATGC CGTCTGACTT CGACGCCAAA
GCGAAAGAAT TCATCGCTAA ACTGCAGGCT AATCCGGCGA AAATCGCCAG TCGTAAAGCG
TCTCAGAATG CGATTGAAGC GTTCGGTCCG CTGTTGCCGG AATTCCTCGG CGGCTCTGCT
GACCTGGCAC CGTCTAACCT GACCCTGTGG TCTGGTTCTA AAGCAATCAA CGAAGATGCT
GCGGGTAACT ACATCCACTA CGGTGTTCGC GAGTTCGGTA TGACCGCGAT TGCTAACGGT
ATCTCCCTGC ACGGTGGTTT CCTGCCGTAC ACCTCCACCT TCCTGATGTT CGTGGAATAC
GCACGTAACG CCGTACGTAT GGCTGCGCTG ATGAAACAGC GTCAGGTGAT GGTTTACACC
CACGACTCCA TCGGTCTGGG CGAAGATGGC CCGACTCACC AGCCGGTTGA GCAGGTCGCT
TCACTGCGCG TGACCCCGAA CATGTCTACA TGGCGTCCAT GTGACCAGGT TGAATCTGCA
GTGGCATGGA AATACGGCGT TGAGCGTCAG GACGGCCCGA CCGCGCTGAT CCTCTCCCGT
CAGAACCTGG CGCAGCAGGA ACGTACTGAA GAGCAGCTGG CAAACATCGC GCGCGGTGGT
TATGTGCTGA AAGACTGTGC CGGTCAGCCG GAACTGATCT TCATCGCGAC CGGTTCGGAA
GTTGAACTGG CTGTTGCCGC CTACGAAAAA CTGACAGCCG AAGGCGTGAA GGCGCGCGTG
GTTTCCATGC CGTCTACCGA CGCGTTCGAC AAGCAGGATG CGGCTTACCG TGAATCCGTA
CTGCCGAAAG CGGTTACTGC ACGCGTTGCG GTAGAAGCGG GTATTGCTGA CTACTGGTAC
AAGTATGTTG GCCTGAACGG CGCTATCGTC GGCATGACCA CCTTCGGTGA ATCTGCTCCG
GCAGAACTGC TGTTTGAAGA GTTCGGCTTC ACCGTTGACA ACGTTGTTGC GAAAGCCAAA
GCGCTGCTGT AA
 
Protein sequence
MSSRKELANA IRALSMDAVQ KAKSGHPGAP MGMADIAEVL WRDFLKHNPQ NPSWADRDRF 
VLSNGHGSML IYSLLHLTGY DLPMEELKNF RQLHSKTPGH PEVGYTAGVE TTTGPLGQGI
ANAVGMAIAE KTLAAQFNRP GHDIVDHYTY AFMGDGCMME GISHEVCSLA GTLKLGKLIA
FYDDNGISID GHVEGWFTDD TAMRFEAYGW HVIRDIDGHD AASIKRAVEE ARAVTDKPSL
LMCKTIIGFG SPNKAGTHDS HGAPLGDAEI ALTREQLGWK YAPFEIPSEI YAQWDAKEAG
QAKESAWNEK FAAYAKAYPQ EAAEFTRRMK GEMPSDFDAK AKEFIAKLQA NPAKIASRKA
SQNAIEAFGP LLPEFLGGSA DLAPSNLTLW SGSKAINEDA AGNYIHYGVR EFGMTAIANG
ISLHGGFLPY TSTFLMFVEY ARNAVRMAAL MKQRQVMVYT HDSIGLGEDG PTHQPVEQVA
SLRVTPNMST WRPCDQVESA VAWKYGVERQ DGPTALILSR QNLAQQERTE EQLANIARGG
YVLKDCAGQP ELIFIATGSE VELAVAAYEK LTAEGVKARV VSMPSTDAFD KQDAAYRESV
LPKAVTARVA VEAGIADYWY KYVGLNGAIV GMTTFGESAP AELLFEEFGF TVDNVVAKAK
ALL