Gene EcDH1_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2958 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3175229 
End bp3177175 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content55% 
IMG OID 
ProductPTS system, N-acetylglucosamine-specific IIBC subunit 
Protein accessionACX40587 
Protein GI260450165 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT TAGGTTTTTT CCAGCGACTC GGTAGGGCGT TACAGCTCCC TATCGCGGTG 
CTGCCGGTGG CGGCACTGTT GCTGCGATTC GGTCAGCCAG ATTTACTTAA CGTTGCGTTT
ATTGCCCAGG CGGGCGGTGC GATTTTTGAT AACCTCGCAT TAATCTTCGC CATCGGTGTG
GCATCCAGCT GGTCGAAAGA CAGCGCTGGT GCGGCGGCGC TGGCGGGTGC GGTAGGTTAC
TTTGTGTTAA CCAAAGCGAT GGTGACCATC AACCCAGAAA TTAACATGGG TGTACTGGCG
GGTATCATTA CCGGTCTGGT TGGTGGCGCA GCCTATAACC GTTGGTCCGA TATTAAACTG
CCGGACTTCC TGAGCTTCTT CGGCGGCAAA CGCTTTGTGC CGATTGCCAC CGGATTCTTC
TGCCTGGTGC TGGCGGCCAT TTTTGGTTAC GTCTGGCCGC CGGTACAGCA CGCTATCCAT
GCAGGCGGCG AGTGGATCGT TTCTGCGGGC GCGCTGGGTT CCGGTATCTT TGGTTTCATC
AACCGTCTGC TGATCCCAAC CGGTCTGCAT CAGGTACTGA ACACCATCGC CTGGTTCCAG
ATTGGTGAAT TCACCAACGC GGCGGGTACG GTTTTCCACG GTGACATTAA CCGCTTCTAT
GCCGGTGACG GCACCGCGGG GATGTTCATG TCCGGCTTCT TCCCGATCAT GATGTTCGGT
CTGCCGGGTG CGGCGCTGGC GATGTACTTC GCAGCACCGA AAGAGCGTCG TCCGATGGTT
GGCGGTATGC TGCTTTCTGT TGCTGTTACT GCGTTCCTGA CCGGTGTGAC TGAGCCGCTG
GAATTCCTGT TCATGTTCCT TGCTCCGCTG CTGTACCTCC TGCACGCACT GCTGACCGGT
ATCAGCCTGT TTGTGGCAAC GCTGCTGGGT ATCCACGCGG GCTTCTCTTT CTCTGCGGGG
GCTATCGACT ACGCGTTGAT GTATAACCTG CCGGCCGCCA GCCAGAACGT CTGGATGCTG
CTGGTGATGG GCGTTATCTT CTTCGCTATC TACTTCGTGG TGTTCAGTTT GGTTATCCGC
ATGTTCAACC TGAAAACGCC GGGTCGTGAA GATAAAGAAG ACGAGATCGT TACTGAAGAA
GCCAACAGCA ACACTGAAGA AGGTCTGACT CAACTGGCAA CCAACTATAT TGCTGCGGTT
GGCGGCACTG ACAACCTGAA AGCGATTGAC GCCTGTATCA CCCGTCTGCG CCTTACAGTG
GCTGACTCTG CCCGCGTTAA CGATACGATG TGTAAACGTC TGGGTGCTTC TGGGGTAGTG
AAACTGAACA AACAGACTAT TCAGGTGATT GTTGGCGCGA AAGCAGAATC CATCGGCGAT
GCGATGAAGA AAGTCGTTGC CCGTGGTCCG GTAGCCGCTG CGTCAGCTGA AGCAACTCCG
GCAACTGCCG CGCCTGTAGC AAAACCGCAG GCTGTACCAA ACGCGGTATC TATCGCGGAG
CTGGTATCGC CGATTACCGG TGATGTCGTG GCACTGGATC AGGTTCCTGA CGAAGCATTC
GCCAGCAAAG CGGTGGGTGA CGGTGTGGCG GTGAAACCGA CAGATAAAAT CGTCGTATCA
CCAGCCGCAG GGACAATCGT GAAAATCTTC AACACCAACC ACGCGTTCTG CCTGGAAACC
GAAAAAGGCG CGGAGATCGT CGTCCATATG GGTATCGACA CCGTAGCGCT GGAAGGTAAA
GGCTTTAAAC GTCTGGTGGA AGAGGGTGCG CAGGTAAGCG CAGGGCAACC GATTCTGGAA
ATGGATCTGG ATTACCTGAA CGCTAACGCC CGCTCGATGA TTAGCCCGGT GGTTTGCAGC
AATATCGACG ATTTCAGTGG CTTGATCATT AAAGCTCAGG GCCATATTGT GGCGGGTCAA
ACACCGCTGT ATGAAATCAA AAAGTAA
 
Protein sequence
MNILGFFQRL GRALQLPIAV LPVAALLLRF GQPDLLNVAF IAQAGGAIFD NLALIFAIGV 
ASSWSKDSAG AAALAGAVGY FVLTKAMVTI NPEINMGVLA GIITGLVGGA AYNRWSDIKL
PDFLSFFGGK RFVPIATGFF CLVLAAIFGY VWPPVQHAIH AGGEWIVSAG ALGSGIFGFI
NRLLIPTGLH QVLNTIAWFQ IGEFTNAAGT VFHGDINRFY AGDGTAGMFM SGFFPIMMFG
LPGAALAMYF AAPKERRPMV GGMLLSVAVT AFLTGVTEPL EFLFMFLAPL LYLLHALLTG
ISLFVATLLG IHAGFSFSAG AIDYALMYNL PAASQNVWML LVMGVIFFAI YFVVFSLVIR
MFNLKTPGRE DKEDEIVTEE ANSNTEEGLT QLATNYIAAV GGTDNLKAID ACITRLRLTV
ADSARVNDTM CKRLGASGVV KLNKQTIQVI VGAKAESIGD AMKKVVARGP VAAASAEATP
ATAAPVAKPQ AVPNAVSIAE LVSPITGDVV ALDQVPDEAF ASKAVGDGVA VKPTDKIVVS
PAAGTIVKIF NTNHAFCLET EKGAEIVVHM GIDTVALEGK GFKRLVEEGA QVSAGQPILE
MDLDYLNANA RSMISPVVCS NIDDFSGLII KAQGHIVAGQ TPLYEIKK