Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3344 |
Symbol | hldE |
ID | 6145779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3420937 |
End bp | 3422370 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618173 |
Product | bifunctional heptose 7-phosphate kinase/heptose 1-phosphate adenyltransferase |
Protein accession | YP_001745323 |
Protein GI | 170683743 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2870] ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase |
TIGRFAM ID | [TIGR00125] cytidyltransferase-related domain [TIGR02198] rfaE bifunctional protein, domain I [TIGR02199] rfaE bifunctional protein, domain II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.078646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAA CGCTGCCAGA GTTTGAACGT GCAGGAGTGA TGGTGGTTGG TGATGTGATG CTGGATCGTT ACTGGTACGG TCCCACCAGC CGTATCTCGC CGGAAGCGCC GGTGCCCGTG GTTAAAGTGA ATACCATCGA AGAACGTCCG GGCGGCGCGG CTAACGTGGC GATGAATATC GCTTCTCTCG GTGCTAATGC ACGCCTGGTC GGGTTGACGG GCATTGACGA TGCAGCGCGC GCGCTGAGTA AATCTCTGGC CGACGTCAAC GTCAAATGCG ACTTCGTTTC TGTACCGACG CATCCGACTA TCACCAAATT ACGGGTGCTT TCCCGCAACC AACAGCTGAT TCGTCTGGAT TTTGAAGAAG GTTTCGAAGG TGTTGATCCG CAACCGCTGC ACGAACGGAT TAATCAGGCG CTGAGTTCGA TTGGCGCGCT GGTGCTTTCT GACTACGCCA AAGGTGCGCT GGCAAGCGTA CAGCAGATGA TCCAACTGGC GCGTAAAGCG GGTGTTCCGG TGCTGATTGA TCCAAAAGGT ACCGATTTTG AGCGCTACCG CGGCGCTACG CTGTTAACGC CGAATCTCTC GGAATTTGAA GCTGTTGTCG GTAAATGTAA GACCGAAGAA GAGATTGTTG AGCGCGGCAT GAAACTAATT GCCGATTACG AACTCTCGGC TCTGTTAGTG ACCCGTTCCG AACAGGGTAT GTCGCTGCTG CAACCGGGTA AAGCGCCGCT GCATATGCCA ACCCAGGCGC AGGAAGTGTA TGACGTTACC GGTGCGGGCG ACACGGTGAT TGGCGTCCTG GCGGCAACGC TGGCAGCGGG TAATTCGCTG GAAGAAGCCT GCTTCTTTGC CAATGCGGCG GCTGGTGTGG TGGTCGGCAA ACTGGGGACA TCCACGGTTT CGCCGATCGA GCTGGAAAAC GCAGTACGTG GACGTGCAGA TACCGGCTTT GGCGTGATGA CCGAAGAGGA ACTGAAGCTT GCTGTAGCGG CAGCGCGTAA ACGCGGTGAA AAAGTGGTGA TGACCAATGG CGTCTTTGAC ATCCTGCACG CCGGACACGT CTCTTATCTG GCAAATGCCC GCAAACTGGG TGACCGTTTG ATTGTCGCCG TCAACAGCGA TGCCTCCACC AAACGGCTGA AAGGGGATTC CCGCCCCGTT AACCCGCTTG AACAGCGTAT GATTGTGCTG GGCGCACTGG AAGCGGTCGA CTGGGTGGTG TCGTTTGAAG AAGACACGCC GCAGCGCTTG ATCGCCGGGA TCCTGCCAGA CCTGCTGGTG AAAGGCGGCG ATTATAAACC AGAAGAGATT GCCGGGAGTA AAGAAGTCTG GGCCAATGGT GGCGAAGTGC TGGTGCTCAA CTTTGAAGAC GGTTGCTCGA CCACTAACAT TATCAAGAAG ATCCAACAGG ATAAAAAAGG CTAA
|
Protein sequence | MKVTLPEFER AGVMVVGDVM LDRYWYGPTS RISPEAPVPV VKVNTIEERP GGAANVAMNI ASLGANARLV GLTGIDDAAR ALSKSLADVN VKCDFVSVPT HPTITKLRVL SRNQQLIRLD FEEGFEGVDP QPLHERINQA LSSIGALVLS DYAKGALASV QQMIQLARKA GVPVLIDPKG TDFERYRGAT LLTPNLSEFE AVVGKCKTEE EIVERGMKLI ADYELSALLV TRSEQGMSLL QPGKAPLHMP TQAQEVYDVT GAGDTVIGVL AATLAAGNSL EEACFFANAA AGVVVGKLGT STVSPIELEN AVRGRADTGF GVMTEEELKL AVAAARKRGE KVVMTNGVFD ILHAGHVSYL ANARKLGDRL IVAVNSDAST KRLKGDSRPV NPLEQRMIVL GALEAVDWVV SFEEDTPQRL IAGILPDLLV KGGDYKPEEI AGSKEVWANG GEVLVLNFED GCSTTNIIKK IQQDKKG
|
| |