Gene EcDH1_2918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2918 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3128941 
End bp3131397 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content50% 
IMG OID 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionACX40551 
Protein GI260450129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.528881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACCG TGAATATTTA TCGACTCTCT TTTGTATCCT GCCTGGTCAT GGCGATGCCT 
TGCGCAATGG CGGTCGAATT CAATCTGAAT GTTCTCGATA AATCAATGCG CGACCGCATT
GATATTTCAT TATTAAAGGA AAAAGGAGTC ATTGCTCCCG GTGAATATTT TGTTAGCGTT
GCGGTGAATA ACAACAAAAT CAGTAATGGG CAAAAAATTA ACTGGCAAAA AAAGGGTGAC
AAAACCATTC CATGCATCAA TGATTCACTG GTCGATAAAT TTGGTTTAAA ACCAGATATC
CGTCAGTCCT TGCCACAGAT AGATCGGTGT ATTGATTTCA GTTCCCGACC TGAAATGCTC
TTCAATTTCG ATCAAGCCAA TCAGCAACTG AATATTAGTA TTCCGCAAGC CTGGCTGGCG
TGGCACTCAG AAAACTGGGC TCCCCCCTCT ACATGGAAAG AAGGTGTTGC CGGTGTCCTG
ATGGATTACA ACTTGTTTGC CAGCAGCTAC CGCCCACAGG ACGGCAGCAG CAGCACTAAC
CTGAATGCCT ACGGTACCGC CGGAATTAAC GCCGGGGCAT GGCGCTTACG CAGTGATTAC
CAGCTTAATA AGACCGATAG CGAAGATAAC CATGACCAGT CAGGCGGAAT ATCGCGCACC
TATCTTTTTC GTCCATTACC GCAATTAGGC TCTAAGTTAA CCCTCGGCGA AACCGATTTC
AGTTCCAATA TTTTCGATGG TTTTTCTTAT ACCGGCGCGG CACTGGCGAG TGACGATCGA
ATGTTACCGT GGGAGCTGCG TGGCTACGCC CCACAAATTA GCGGTATTGC ACAGACCAAT
GCCACGGTGA CGATCAGTCA ATCAGGCCGC GTCATTTACC AGAAAAAAGT CCCGCCAGGC
CCGTTTATTA TTGATGACCT CAATCAGTCT GTTCAGGGCA CGCTGGATGT CAAAGTGACG
GAAGAAGATG GTCGGGTGAA CAATTTCCAG GTTTCGGCAG CATCGACGCC CTTCCTGACT
CGCCAGGGAC AGGTTCGCTA TAAATTGGCC GCGGGTCAGC CACGGCCTTC CATGTCACAT
CAAACTGAAA ATGAAACCTT TTTTAGCAAT GAAGTTTCCT GGGGGATGCT CTCAAACACC
TCGCTGTACG GCGGCCTGCT GATTTCTGAT GATGACTACC ATTCTGCCGC AATGGGTATC
GGGCAAAATA TGCTGTGGCT TGGCGCACTG TCCTTTGATG TCACCTGGGC CAGTAGCCAT
TTTGATACTC AGCAGGACGA GCGGGGCTTA AGCTACCGTT TTAATTACAG CAAACAAGTG
GATGCCACCA ACAGCACGAT TTCGCTCGCC GCTTATCGCT TCTCAGATCG TCATTTTCAC
AGCTACGCCA ACTATCTGGA TCACAAATAC AACGACAGCG ATGCGCAGGA CGAAAAACAG
ACGATCAGCT TATCCGTGGG CCAACCGATT ACCCCACTAA ACCTCAATCT TTACGCCAAC
CTGCTACATC AAACCTGGTG GAATGCAGAC GCCTCCACGA CCGCCAACAT CACAGCAGGT
TTTAATGTTG ATATTGGTGA CTGGAGAGAT ATCTCGATTT CGACGTCATT CAATACGACC
CACTACGAAG ATAAAGATCG CGACAACCAG ATTTATCTGT CGATTTCGCT CCCCTTCGGT
AACGGTGGTC GGGTTGGCTA TGACATGCAA AACAGTAGCC ACAGCACCAT ACACCGCATG
TCGTGGAACG ATACGCTGGA TGAACGTAAT AGCTGGGGCA TGTCTGCCGG ACTGCAATCC
GATCGTCCGG ACAATGGAGC CCAGGTGAGC GGTAACTATC AGCACCTGAG TTCAGCGGGT
GAGTGGGATA TTTCTGGTAC CTATGCCGCC AGTGATTACA GTTCCGTCAG CAGCAGCTGG
AGCGGTTCTT TCACCGCAAC CCAATATGGT GCAGCATTTC ATCGCCGCAG CTCCACCAAT
GAACCACGCC TGATGGTCAG CACCGATGGC GTGGCAGATA TTCCGGTTCA GGGCAATCTC
GACTACACCA ACCATTTTGG CATTGCGGTG GTGCCGTTGA TTTCCAGTTA CCAGCCTTCC
ACCGTGGCGG TGAACATGAA TGACTTACCC GACGGCGTAA CAGTTGCAGA AAACGTCATC
AAGGAAACAT GGATTGAAGG CGCGATAGGT TACAAATCAC TGGCTTCCCG TTCCGGTAAA
GACGTTAACG TCATCATACG CAACGCCAGC GGTCAGTTCC CTCCCCTCGG TGCGGATATC
CGCCAGGATG ACAGCGGCAT TAGCGTGGGT ATGGTTGGCG AGGAAGGACA TGCCTGGTTA
AGCGGTGTCG CTGAAAATCA ACTGTTTACC GTGGTCTGGG GTGAGCAAAG CTGCATTATT
CATCTGCCAG AACGTCTGGA AGACACGACC AAACGCCTGA TTTTACCTTG TCATTAA
 
Protein sequence
MDTVNIYRLS FVSCLVMAMP CAMAVEFNLN VLDKSMRDRI DISLLKEKGV IAPGEYFVSV 
AVNNNKISNG QKINWQKKGD KTIPCINDSL VDKFGLKPDI RQSLPQIDRC IDFSSRPEML
FNFDQANQQL NISIPQAWLA WHSENWAPPS TWKEGVAGVL MDYNLFASSY RPQDGSSSTN
LNAYGTAGIN AGAWRLRSDY QLNKTDSEDN HDQSGGISRT YLFRPLPQLG SKLTLGETDF
SSNIFDGFSY TGAALASDDR MLPWELRGYA PQISGIAQTN ATVTISQSGR VIYQKKVPPG
PFIIDDLNQS VQGTLDVKVT EEDGRVNNFQ VSAASTPFLT RQGQVRYKLA AGQPRPSMSH
QTENETFFSN EVSWGMLSNT SLYGGLLISD DDYHSAAMGI GQNMLWLGAL SFDVTWASSH
FDTQQDERGL SYRFNYSKQV DATNSTISLA AYRFSDRHFH SYANYLDHKY NDSDAQDEKQ
TISLSVGQPI TPLNLNLYAN LLHQTWWNAD ASTTANITAG FNVDIGDWRD ISISTSFNTT
HYEDKDRDNQ IYLSISLPFG NGGRVGYDMQ NSSHSTIHRM SWNDTLDERN SWGMSAGLQS
DRPDNGAQVS GNYQHLSSAG EWDISGTYAA SDYSSVSSSW SGSFTATQYG AAFHRRSSTN
EPRLMVSTDG VADIPVQGNL DYTNHFGIAV VPLISSYQPS TVAVNMNDLP DGVTVAENVI
KETWIEGAIG YKSLASRSGK DVNVIIRNAS GQFPPLGADI RQDDSGISVG MVGEEGHAWL
SGVAENQLFT VVWGEQSCII HLPERLEDTT KRLILPCH