Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4087 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4424615 |
End bp | 4426363 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | transcriptional antiterminator, BglG |
Protein accession | ACX41687 |
Protein GI | 260451265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT CTCAACTTCA CCCTTAACGG CAAAGCCCGC ATTTTCGCCA GTGGCAGTGC GGGCTATCAG CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT CGGCTGCTGG CGCTGTTATT ACTGAATACT TTCACTCCCC GTGCGCAACT CGCCTCGGCG CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA CGCACTTGTT GCCTGGCCAG CCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG AAACGCGTTA TCTTGCTGGC GAACTTGCTG CGCAAAGATC CGTTTTTAAT TCCGCTGGCG GGCATAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG CCGCTCATGC AGGGTGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAT AGCGGTCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CGGGTTTGAT AGAGAAACAG CATCAGCAAG CGCAGGTAAT TTCAGCCGAT AATGTGCAGG GGTTGCTGCA AAGGGTGCCG GGCATCGCGT CATTGAATAT TATTGATGCG CAGCTGGTTG AGAATATTAC CGGGCATTTA TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCACC GCCAGAGCAG CATGAATAAC CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG CTGGAGCGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT GCCACTATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAA ATTGTCGGGT AATTATTGCC CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAACAG CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTCG CACAACACCA TATTACCGCC GATGAAGCAC AACGCATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC CTCGCCCAAC CAGTTGAGGT GAATAACGAA GTCATTAACC ATGTCTTGAT CGCCTGCGCC GCCGCCGATG CGCGTCATGA GCTAAAAATA TTTAGCTATC TGGCAAGCAT ATTGTGTCAG CATCCGGCGG AGATTATTGC CGGGTTAACA GGATATGAGG CATTTATGGA GTTACTTCAC AAGGGGTGA
|
Protein sequence | MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR IFASGSAGYQ LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GITRDNLQHL STACDNQHRW PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEH SGLFLGDNAV RTLTGLIEKQ HQQAQVISAD NVQGLLQRVP GIASLNIIDA QLVENITGHL LRCLAAPVWI AEHRQSSMNN LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI ATINQLAIER DVLNCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VINHVLIACA AADARHELKI FSYLASILCQ HPAEIIAGLT GYEAFMELLH KG
|
| |