Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2021 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 2177531 |
End bp | 2179123 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | PTS system, maltose and glucose-specific IIBC subunit |
Protein accession | ACX39678 |
Protein GI | 260449256 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.271957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCGA AAACAGCACC GAAAGTCACG CTGTGGGAGT TCTTCCAGCA GTTAGGCAAA ACCTTCATGT TACCCGTGGC ATTATTGTCG TTCTGCGGCA TTATGCTCGG CATTGGTAGT TCTCTTAGCA GCCATGATGT CATAACCCTG ATCCCGGTCC TGGGCAACCC CGTGTTGCAG GCTATCTTTA CCTGGATGAG TAAGATTGGC TCGTTTGCTT TTAGTTTCCT GCCTGTCATG TTCTGTATCG CCATCCCGCT GGGCCTGGCA CGCGAAAATA AAGGCGTAGC GGCATTCGCT GGCTTCATCG GTTATGCGGT AATGAACCTC GCGGTAAACT TCTGGTTGAC CAATAAAGGC ATTCTGCCAA CCACGGATGC CGCGGTTCTG AAAGCCAATA ACATCCAGAG CATTCTTGGG ATCCAGTCGA TCGATACCGG GATCCTCGGT GCGGTGATCG CCGGTATTAT CGTCTGGATG CTGCATGAGC GTTTCCATAA TATCCGCCTG CCGGATGCGC TGGCATTCTT CGGCGGTACG CGCTTCGTAC CAATTATCTC CTCGCTGGTG ATGGGCCTTG TCGGCCTGGT GATTCCATTA GTCTGGCCGA TTTTCGCCAT GGGTATTAGC GGCTTGGGCC ATATGATAAA CAGCGCGGGT GATTTCGGAC CGATGCTGTT TGGTACCGGT GAACGTCTGC TGTTGCCGTT TGGTCTGCAT CACATTCTGG TGGCATTAAT TCGCTTTACC GACGCAGGCG GCACGCAGGA AGTCTGCGGT CAAACCGTCA GCGGCGCACT GACCATCTTC CAGGCGCAAT TGAGTTGCCC GACCACTCAC GGTTTTTCTG AAAGCGCCAC GCGTTTCCTT TCGCAAGGTA AAATGCCTGC GTTTCTCGGC GGTCTGCCAG GTGCAGCGTT AGCTATGTAT CACTGCGCGC GCCCGGAAAA TCGCCATAAA ATTAAAGGTC TGCTGATTTC TGGCCTGATC GCCTGCGTCG TTGGCGGCAC TACCGAACCG CTGGAATTCC TGTTCCTGTT CGTAGCGCCA GTTCTGTATG TCATCCACGC GCTGTTAACC GGCCTCGGCT TCACCGTCAT GTCTGTGCTC GGCGTCACCA TCGGTAATAC CGACGGCAAT ATCATCGACT TCGTGGTGTT CGGTATTTTG CATGGTCTGT CAACCAAGTG GTACATGGTG CCAGTGGTGG CGGCAATCTG GTTTGTCGTT TACTACGTCA TCTTCCGTTT CGCTATCACC CGCTTCAATC TGAAAACCCC GGGGCGCGAT AGCGAAGTTG CCAGCTCAAT CGAAAAAGCC GTTGCCGGTG CGCCGGGTAA ATCAGGTTAC AACGTTCCTG CAATCCTCGA AGCATTAGGC GGTGCCGACA ATATTGTCAG CCTCGATAAC TGCATTACCC GTCTGCGTTT GTCTGTGAAA GATATGTCGC TTGTTAATGT GCAGGCACTG AAGGACAATC GGGCAATTGG CGTAGTACAA CTTAATCAAC ATAACCTGCA GGTTGTTATC GGGCCACAAG TTCAGTCAGT AAAAGATGAA ATGGCCGGTC TGATGCATAC TGTCCAGGCA TAA
|
Protein sequence | MTAKTAPKVT LWEFFQQLGK TFMLPVALLS FCGIMLGIGS SLSSHDVITL IPVLGNPVLQ AIFTWMSKIG SFAFSFLPVM FCIAIPLGLA RENKGVAAFA GFIGYAVMNL AVNFWLTNKG ILPTTDAAVL KANNIQSILG IQSIDTGILG AVIAGIIVWM LHERFHNIRL PDALAFFGGT RFVPIISSLV MGLVGLVIPL VWPIFAMGIS GLGHMINSAG DFGPMLFGTG ERLLLPFGLH HILVALIRFT DAGGTQEVCG QTVSGALTIF QAQLSCPTTH GFSESATRFL SQGKMPAFLG GLPGAALAMY HCARPENRHK IKGLLISGLI ACVVGGTTEP LEFLFLFVAP VLYVIHALLT GLGFTVMSVL GVTIGNTDGN IIDFVVFGIL HGLSTKWYMV PVVAAIWFVV YYVIFRFAIT RFNLKTPGRD SEVASSIEKA VAGAPGKSGY NVPAILEALG GADNIVSLDN CITRLRLSVK DMSLVNVQAL KDNRAIGVVQ LNQHNLQVVI GPQVQSVKDE MAGLMHTVQA
|
| |