Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3429 |
Symbol | |
ID | 6144342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3508750 |
End bp | 3510030 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618258 |
Product | D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit |
Protein accession | YP_001745407 |
Protein GI | 170680844 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4573] Predicted tagatose 6-phosphate kinase |
TIGRFAM ID | [TIGR02810] D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACATC TGACAGAAAT GGTGAGACAG CACAAAGAGG GCAAAACAAA TGGGATTTAT GCCGTTTGTT CCGCACATCC GCTGGTGCTG GAAGCTGCAA TCCGCTACGC CAGTGCAAAC CAAACGCCGT TACTGATTGA AGCAACCTCC AATCAGGTGG ACCAGTTCGG CGGTTATACC GGAATGACGC CCGCCGATTT TCGCGGCTTT GTTTGTCAGC TCGCCGACTC GTTGAATTTC CCACAGGATG CGTTGATTCT GGGTGGCGAC CATCTGGGGC CAAACCGCTG GCAAAACCTG CCTGCCGCTC AGGCAATGGC CAATGCCGAT GATTTGATTA AAAGCTACGT GGCGGCAGGA TTCAAAAAAA TTCACCTTGA TTGCAGCATG TCCTGTCAGG ACGATCCGAT TCCCTTAACT GATGACATCG TGGCCGAACG CGCCGCTCGT CTGGCGAAAG TGGCGGAAGA AACCTGCCGT GAACATTTTG GCGCGGCCGA TCTGGAGTAT GTCATTGGTA CCGAAGTACC GGTACCTGGC GGCGCGCATG AAACCTTAAG CGAGCTGGCG GTCACCACGC CGGATGCCGC CCGCGCCACG CTGGAAGCCC ATCGTCACGC CTTTGAAAAG CAAGGTCTGA GTGCCATCTG GCCACGCATT ATTGCCCTGG TGGTTCAACC CGGCGTCGAA TTCGATCACA CCAACGTTAT TGATTATCAG CCCGTCAAAG CGACCGCCTT AAGCCAGATG GTCGAAAACT ACGAAACGCT GATTTTCGAA GCGCACTCTA CTGATTACCA AACGCCGCAA TCGCTGCGCC AGCTGGTGAT TGACCACTTT GCCATTCTGA AAGTTGGCCC GGCGCTGACC TTTGCTCTGC GTGAAGCTCT GTTCTCTCTG GCGGCGATTG AAGAAGAACT GGTACCAGCA AAAGCCTGTT CTGGTCTGCG TCAGGTGCTG GAAAACGTGA TGCTCGACCG CCCGGAATAC TGGCAAAGCC ACTACCACGG TGACGGCAAC GCGCGTCGTC TGGCGCGTGG TTATAGCTAC TCGGATCGCG TGCGCTATTA CTGGCCGGAC AGCCAGATTG ATGACGCTTT CGCTCATCTG GTTCGTAATC TGGCGGATTC ACCAATTCCG CTACCGCTGA TCAGCCAGTA TTTGCCGCTG CAATACGTGA AAGTTCGCTC CGGCGAGCTG CAGCCAACAC CACGGGAACT CATTATCAAC CATATTCAGG ACATCCTGGC GCAGTACCAC ACAGCCTGTG AAGGCCAATA A
|
Protein sequence | MKHLTEMVRQ HKEGKTNGIY AVCSAHPLVL EAAIRYASAN QTPLLIEATS NQVDQFGGYT GMTPADFRGF VCQLADSLNF PQDALILGGD HLGPNRWQNL PAAQAMANAD DLIKSYVAAG FKKIHLDCSM SCQDDPIPLT DDIVAERAAR LAKVAEETCR EHFGAADLEY VIGTEVPVPG GAHETLSELA VTTPDAARAT LEAHRHAFEK QGLSAIWPRI IALVVQPGVE FDHTNVIDYQ PVKATALSQM VENYETLIFE AHSTDYQTPQ SLRQLVIDHF AILKVGPALT FALREALFSL AAIEEELVPA KACSGLRQVL ENVMLDRPEY WQSHYHGDGN ARRLARGYSY SDRVRYYWPD SQIDDAFAHL VRNLADSPIP LPLISQYLPL QYVKVRSGEL QPTPRELIIN HIQDILAQYH TACEGQ
|
| |