Gene EcDH1_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4087 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4424615 
End bp4426363 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content50% 
IMG OID 
Producttranscriptional antiterminator, BglG 
Protein accessionACX41687 
Protein GI260451265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAACG AACGCCAGTT AAAGATTGTC GATCTGCTGG AGCAACAGCC GCGCACGCCT 
GGCGAGCTGG CGCAACAGAC TGGCGTTTCA GGCAGGACCA TCCTGCGTGA TATTGACTAT
CTCAACTTCA CCCTTAACGG CAAAGCCCGC ATTTTCGCCA GTGGCAGTGC GGGCTATCAG
CTGGAAATCT TCGAGCGCCG CAGCTTTTTT CAGTTGCTGC AAAAGCACGA TAACGACGAT
CGGCTGCTGG CGCTGTTATT ACTGAATACT TTCACTCCCC GTGCGCAACT CGCCTCGGCG
CTTAATTTGC CAGAAACGTG GGTAGCAGAG CGTCTGCCCC GGTTAAAACA GCGTTATGAA
CGCACTTGTT GCCTGGCCAG CCGCCCTGGT TTGGGCCATT TCATTGATGA GACAGAAGAG
AAACGCGTTA TCTTGCTGGC GAACTTGCTG CGCAAAGATC CGTTTTTAAT TCCGCTGGCG
GGCATAACAC GAGACAACCT TCAGCATTTA TCCACGGCCT GCGACAACCA ACACCGCTGG
CCGCTCATGC AGGGTGATTA TCTCTCCAGC CTGATTCTGG CGATTTACGC CCTGCGTAAT
CAACTGACCG ATGAGTGGCC GCAATATCCC GGTGACGAGA TAAAACAAAT CGTTGAACAT
AGCGGTCTGT TTCTTGGTGA TAACGCTGTA AGAACCCTGA CGGGTTTGAT AGAGAAACAG
CATCAGCAAG CGCAGGTAAT TTCAGCCGAT AATGTGCAGG GGTTGCTGCA AAGGGTGCCG
GGCATCGCGT CATTGAATAT TATTGATGCG CAGCTGGTTG AGAATATTAC CGGGCATTTA
TTACGTTGCC TTGCCGCACC AGTGTGGATT GCTGAGCACC GCCAGAGCAG CATGAATAAC
CTGAAAGCCG CCTGGCCTGC GGCCTTTGAT ATGAGTCTGC ACTTTATTAC GCTACTGCGT
GAACAGCTCG ATATTCCCCT TTTCGACAGC GATCTGATCG GTTTGTATTT TGCCTGTGCG
CTGGAGCGGC ATCAAAACGA ACGCCAGCCG ATCATTTTGC TCTCGGACCA GAACGCGATT
GCCACTATTA ATCAGCTCGC CATTGAGCGC GATGTTTTAA ATTGTCGGGT AATTATTGCC
CGTAGCTTAA GCGAACTTGT TGCCATTCGC GAAGAGATTG AGCCGTTATT GATCATTAAC
AACAGCCATT ATTTACTGGA TGACGCGGTA AATAATTACA TCACCGTAAA AAATATCATT
ACGGCTGCCG GTATCGAACA AATAAAACAT TTTCTGGCGA CGGCATTTAT TCGCCAACAG
CCGGAGCGTT TTTTCTCTGC CCCCGGAAGT TTTCATTATT CGAATGTACG CGGTGAAAGC
TGGCAACATA TTACCCGGCA AATTTGTGCG CAATTAGTCG CACAACACCA TATTACCGCC
GATGAAGCAC AACGCATCAT CGCCCGCGAA GGCGAAGGTG AAAACCTGAT TGTTAATCGC
CTCGCCATCC CACATTGCTG GAGCGAACAG GAGCGACGTT TTCGTGGATT TTTTATTACC
CTCGCCCAAC CAGTTGAGGT GAATAACGAA GTCATTAACC ATGTCTTGAT CGCCTGCGCC
GCCGCCGATG CGCGTCATGA GCTAAAAATA TTTAGCTATC TGGCAAGCAT ATTGTGTCAG
CATCCGGCGG AGATTATTGC CGGGTTAACA GGATATGAGG CATTTATGGA GTTACTTCAC
AAGGGGTGA
 
Protein sequence
MLNERQLKIV DLLEQQPRTP GELAQQTGVS GRTILRDIDY LNFTLNGKAR IFASGSAGYQ 
LEIFERRSFF QLLQKHDNDD RLLALLLLNT FTPRAQLASA LNLPETWVAE RLPRLKQRYE
RTCCLASRPG LGHFIDETEE KRVILLANLL RKDPFLIPLA GITRDNLQHL STACDNQHRW
PLMQGDYLSS LILAIYALRN QLTDEWPQYP GDEIKQIVEH SGLFLGDNAV RTLTGLIEKQ
HQQAQVISAD NVQGLLQRVP GIASLNIIDA QLVENITGHL LRCLAAPVWI AEHRQSSMNN
LKAAWPAAFD MSLHFITLLR EQLDIPLFDS DLIGLYFACA LERHQNERQP IILLSDQNAI
ATINQLAIER DVLNCRVIIA RSLSELVAIR EEIEPLLIIN NSHYLLDDAV NNYITVKNII
TAAGIEQIKH FLATAFIRQQ PERFFSAPGS FHYSNVRGES WQHITRQICA QLVAQHHITA
DEAQRIIARE GEGENLIVNR LAIPHCWSEQ ERRFRGFFIT LAQPVEVNNE VINHVLIACA
AADARHELKI FSYLASILCQ HPAEIIAGLT GYEAFMELLH KG