Gene EcSMS35_3344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3344 
SymbolhldE 
ID6145779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3420937 
End bp3422370 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content55% 
IMG OID641618173 
Productbifunctional heptose 7-phosphate kinase/heptose 1-phosphate adenyltransferase 
Protein accessionYP_001745323 
Protein GI170683743 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2870] ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase 
TIGRFAM ID[TIGR00125] cytidyltransferase-related domain
[TIGR02198] rfaE bifunctional protein, domain I
[TIGR02199] rfaE bifunctional protein, domain II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.078646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CGCTGCCAGA GTTTGAACGT GCAGGAGTGA TGGTGGTTGG TGATGTGATG 
CTGGATCGTT ACTGGTACGG TCCCACCAGC CGTATCTCGC CGGAAGCGCC GGTGCCCGTG
GTTAAAGTGA ATACCATCGA AGAACGTCCG GGCGGCGCGG CTAACGTGGC GATGAATATC
GCTTCTCTCG GTGCTAATGC ACGCCTGGTC GGGTTGACGG GCATTGACGA TGCAGCGCGC
GCGCTGAGTA AATCTCTGGC CGACGTCAAC GTCAAATGCG ACTTCGTTTC TGTACCGACG
CATCCGACTA TCACCAAATT ACGGGTGCTT TCCCGCAACC AACAGCTGAT TCGTCTGGAT
TTTGAAGAAG GTTTCGAAGG TGTTGATCCG CAACCGCTGC ACGAACGGAT TAATCAGGCG
CTGAGTTCGA TTGGCGCGCT GGTGCTTTCT GACTACGCCA AAGGTGCGCT GGCAAGCGTA
CAGCAGATGA TCCAACTGGC GCGTAAAGCG GGTGTTCCGG TGCTGATTGA TCCAAAAGGT
ACCGATTTTG AGCGCTACCG CGGCGCTACG CTGTTAACGC CGAATCTCTC GGAATTTGAA
GCTGTTGTCG GTAAATGTAA GACCGAAGAA GAGATTGTTG AGCGCGGCAT GAAACTAATT
GCCGATTACG AACTCTCGGC TCTGTTAGTG ACCCGTTCCG AACAGGGTAT GTCGCTGCTG
CAACCGGGTA AAGCGCCGCT GCATATGCCA ACCCAGGCGC AGGAAGTGTA TGACGTTACC
GGTGCGGGCG ACACGGTGAT TGGCGTCCTG GCGGCAACGC TGGCAGCGGG TAATTCGCTG
GAAGAAGCCT GCTTCTTTGC CAATGCGGCG GCTGGTGTGG TGGTCGGCAA ACTGGGGACA
TCCACGGTTT CGCCGATCGA GCTGGAAAAC GCAGTACGTG GACGTGCAGA TACCGGCTTT
GGCGTGATGA CCGAAGAGGA ACTGAAGCTT GCTGTAGCGG CAGCGCGTAA ACGCGGTGAA
AAAGTGGTGA TGACCAATGG CGTCTTTGAC ATCCTGCACG CCGGACACGT CTCTTATCTG
GCAAATGCCC GCAAACTGGG TGACCGTTTG ATTGTCGCCG TCAACAGCGA TGCCTCCACC
AAACGGCTGA AAGGGGATTC CCGCCCCGTT AACCCGCTTG AACAGCGTAT GATTGTGCTG
GGCGCACTGG AAGCGGTCGA CTGGGTGGTG TCGTTTGAAG AAGACACGCC GCAGCGCTTG
ATCGCCGGGA TCCTGCCAGA CCTGCTGGTG AAAGGCGGCG ATTATAAACC AGAAGAGATT
GCCGGGAGTA AAGAAGTCTG GGCCAATGGT GGCGAAGTGC TGGTGCTCAA CTTTGAAGAC
GGTTGCTCGA CCACTAACAT TATCAAGAAG ATCCAACAGG ATAAAAAAGG CTAA
 
Protein sequence
MKVTLPEFER AGVMVVGDVM LDRYWYGPTS RISPEAPVPV VKVNTIEERP GGAANVAMNI 
ASLGANARLV GLTGIDDAAR ALSKSLADVN VKCDFVSVPT HPTITKLRVL SRNQQLIRLD
FEEGFEGVDP QPLHERINQA LSSIGALVLS DYAKGALASV QQMIQLARKA GVPVLIDPKG
TDFERYRGAT LLTPNLSEFE AVVGKCKTEE EIVERGMKLI ADYELSALLV TRSEQGMSLL
QPGKAPLHMP TQAQEVYDVT GAGDTVIGVL AATLAAGNSL EEACFFANAA AGVVVGKLGT
STVSPIELEN AVRGRADTGF GVMTEEELKL AVAAARKRGE KVVMTNGVFD ILHAGHVSYL
ANARKLGDRL IVAVNSDAST KRLKGDSRPV NPLEQRMIVL GALEAVDWVV SFEEDTPQRL
IAGILPDLLV KGGDYKPEEI AGSKEVWANG GEVLVLNFED GCSTTNIIKK IQQDKKG