Gene EcDH1_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2021 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2177531 
End bp2179123 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content53% 
IMG OID 
ProductPTS system, maltose and glucose-specific IIBC subunit 
Protein accessionACX39678 
Protein GI260449256 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.271957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGA AAACAGCACC GAAAGTCACG CTGTGGGAGT TCTTCCAGCA GTTAGGCAAA 
ACCTTCATGT TACCCGTGGC ATTATTGTCG TTCTGCGGCA TTATGCTCGG CATTGGTAGT
TCTCTTAGCA GCCATGATGT CATAACCCTG ATCCCGGTCC TGGGCAACCC CGTGTTGCAG
GCTATCTTTA CCTGGATGAG TAAGATTGGC TCGTTTGCTT TTAGTTTCCT GCCTGTCATG
TTCTGTATCG CCATCCCGCT GGGCCTGGCA CGCGAAAATA AAGGCGTAGC GGCATTCGCT
GGCTTCATCG GTTATGCGGT AATGAACCTC GCGGTAAACT TCTGGTTGAC CAATAAAGGC
ATTCTGCCAA CCACGGATGC CGCGGTTCTG AAAGCCAATA ACATCCAGAG CATTCTTGGG
ATCCAGTCGA TCGATACCGG GATCCTCGGT GCGGTGATCG CCGGTATTAT CGTCTGGATG
CTGCATGAGC GTTTCCATAA TATCCGCCTG CCGGATGCGC TGGCATTCTT CGGCGGTACG
CGCTTCGTAC CAATTATCTC CTCGCTGGTG ATGGGCCTTG TCGGCCTGGT GATTCCATTA
GTCTGGCCGA TTTTCGCCAT GGGTATTAGC GGCTTGGGCC ATATGATAAA CAGCGCGGGT
GATTTCGGAC CGATGCTGTT TGGTACCGGT GAACGTCTGC TGTTGCCGTT TGGTCTGCAT
CACATTCTGG TGGCATTAAT TCGCTTTACC GACGCAGGCG GCACGCAGGA AGTCTGCGGT
CAAACCGTCA GCGGCGCACT GACCATCTTC CAGGCGCAAT TGAGTTGCCC GACCACTCAC
GGTTTTTCTG AAAGCGCCAC GCGTTTCCTT TCGCAAGGTA AAATGCCTGC GTTTCTCGGC
GGTCTGCCAG GTGCAGCGTT AGCTATGTAT CACTGCGCGC GCCCGGAAAA TCGCCATAAA
ATTAAAGGTC TGCTGATTTC TGGCCTGATC GCCTGCGTCG TTGGCGGCAC TACCGAACCG
CTGGAATTCC TGTTCCTGTT CGTAGCGCCA GTTCTGTATG TCATCCACGC GCTGTTAACC
GGCCTCGGCT TCACCGTCAT GTCTGTGCTC GGCGTCACCA TCGGTAATAC CGACGGCAAT
ATCATCGACT TCGTGGTGTT CGGTATTTTG CATGGTCTGT CAACCAAGTG GTACATGGTG
CCAGTGGTGG CGGCAATCTG GTTTGTCGTT TACTACGTCA TCTTCCGTTT CGCTATCACC
CGCTTCAATC TGAAAACCCC GGGGCGCGAT AGCGAAGTTG CCAGCTCAAT CGAAAAAGCC
GTTGCCGGTG CGCCGGGTAA ATCAGGTTAC AACGTTCCTG CAATCCTCGA AGCATTAGGC
GGTGCCGACA ATATTGTCAG CCTCGATAAC TGCATTACCC GTCTGCGTTT GTCTGTGAAA
GATATGTCGC TTGTTAATGT GCAGGCACTG AAGGACAATC GGGCAATTGG CGTAGTACAA
CTTAATCAAC ATAACCTGCA GGTTGTTATC GGGCCACAAG TTCAGTCAGT AAAAGATGAA
ATGGCCGGTC TGATGCATAC TGTCCAGGCA TAA
 
Protein sequence
MTAKTAPKVT LWEFFQQLGK TFMLPVALLS FCGIMLGIGS SLSSHDVITL IPVLGNPVLQ 
AIFTWMSKIG SFAFSFLPVM FCIAIPLGLA RENKGVAAFA GFIGYAVMNL AVNFWLTNKG
ILPTTDAAVL KANNIQSILG IQSIDTGILG AVIAGIIVWM LHERFHNIRL PDALAFFGGT
RFVPIISSLV MGLVGLVIPL VWPIFAMGIS GLGHMINSAG DFGPMLFGTG ERLLLPFGLH
HILVALIRFT DAGGTQEVCG QTVSGALTIF QAQLSCPTTH GFSESATRFL SQGKMPAFLG
GLPGAALAMY HCARPENRHK IKGLLISGLI ACVVGGTTEP LEFLFLFVAP VLYVIHALLT
GLGFTVMSVL GVTIGNTDGN IIDFVVFGIL HGLSTKWYMV PVVAAIWFVV YYVIFRFAIT
RFNLKTPGRD SEVASSIEKA VAGAPGKSGY NVPAILEALG GADNIVSLDN CITRLRLSVK
DMSLVNVQAL KDNRAIGVVQ LNQHNLQVVI GPQVQSVKDE MAGLMHTVQA