Gene EcDH1_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1548 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1685949 
End bp1688429 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content46% 
IMG OID 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionACX39216 
Protein GI260448794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.2778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAGAA TGACCCCACT TGCATCAGCA ATCGTAGCGT TATTGCTCGG CATTGAAGCT 
TATGCAGCTG AAGAAACCTT TGATACCCAT TTTATGATAG GTGGAATGAA AGACCAGCAG
GTTGCAAATA TTCGTCTTGA TGATAATCAA CCCTTACCGG GGCAGTATGA CATCGATATT
TATGTCAATA AGCAATGGCG CGGGAAATAT GAGATTATTG TTAAAGACAA CCCGCAAGAA
ACATGTTTAT CAAGAGAAGT TATCAAGCGG TTAGGCATTA ATAGCGATAA CTTCGCCAGC
GGTAAGCAAT GTTTAACATT TGAGCAACTT GTTCAGGGTG GGAGCTATAC CTGGGATATC
GGGGTTTTTC GTCTCGATTT CAGTGTCCCG CAGGCCTGGG TGGAAGAACT GGAAAGTGGC
TATGTTCCAC CGGAAAACTG GGAGCGGGGT ATTAATGCGT TTTATACCTC TTATTATCTG
AGTCAGTATT ACAGCGACTA TAAAGCGTCG GGTAATAACA AGAGTACATA TGTACGTTTT
AACAGCGGGT TAAATTTACT GGGGTGGCAA CTGCATTCTG ATGCCAGTTT CAGTAAAACA
AATAACAATC CAGGGGTGTG GAAAAGCAAT ACCCTGTATC TGGAACGTGG ATTTGCCCAA
CTTCTCGGCA CGCTTCGCGT GGGTGATATG TACACATCAA GCGATATTTT TGATTCTGTT
CGCTTCAGAG GTGTGCGGTT GTTTCGTGAT ATGCAGATGT TGCCTAACTC GAAACAAAAT
TTTACGCCAC GGGTGCAGGG GATTGCTCAG AGTAACGCGC TGGTAACTAT TGAACAGAAT
GGTTTTGTGG TTTATCAGAA AGAGGTTCCT CCTGGCCCGT TCGCGATTAC AGATTTGCAG
TTGGCCGGTG GTGGAGCAGA TCTTGATGTC AGCGTGAAAG AGGCGGACGG CTCGGTAACC
ACCTATCTGG TGCCTTATGC AGCGGTGCCA AATATGCTGC AACCCGGCGT GTCGAAATAT
GATTTAGCGG CGGGTCGTAG CCATATTGAA GGGGCGAGCA AACAAAGTGA TTTTGTCCAG
GCGGGTTATC AGTATGGTTT TAATAATTTA TTGACGCTGT ATGGTGGCTC GATGGTCGCG
AATAATTATT ACGCGTTTAC TTTGGGGGCT GGCTGGAATA CACGCATTGG TGCCATTTCC
GTCGATGCCA CTAAGTCGCA TAGTAAACAA GACAACGGCG ATGTGTTTGA CGGGCAAAGT
TATCAAATTG CCTACAACAA ATTTGTGAGC CAAACGTCGA CGCGTTTTGG TCTGGCGGCC
TGGCGTTATT CGTCGCGTGA TTACCGGACA TTTAACGATC ACGTTTGGGC AAACAATAAA
GATAATTATC GCCGTGATGA AAACGATGTC TATGACATTG CCGATTATTA CCAGAACGAT
TTTGGCCGCA AAAATAGCTT TTCCGCCAAT ATGAGCCAGT CATTGCCAGA AGGTTGGGGG
TCTGTGTCAT TAAGTACGTT ATGGCGAGAT TACTGGGGGC GTAGCGGCAG TAGTAAGGAT
TATCAGTTGA GTTATTCCAA CAACCTGCGA CGGATAAGCT ATACCCTCGC GGCAAGCCAG
GCTTATGACG AGAATCATCA TGAAGAGAAA CGTTTTAATA TTTTTATATC GATTCCCTTT
GATTGGGGTG ATGACGTTTC GACGCCTCGT CGGCAAATAT ATATGTCTAA CTCAACGACG
TTTGATGATC AGGGGTTTGC CTCAAATAAT ACGGGATTAT CAGGAACAGT AGGGAGTCGG
GATCAGTTCA ATTATGGTGT CAACCTGAGT CATCAACATC AGGGAAATGA AACGACAGCT
GGGGCGAATT TGACCTGGAA CGCGCCGGTT GCGACAGTGA ATGGCAGTTA TAGTCAGTCG
AGTACTTATC GACAGGCTGG AGCCAGTGTT TCAGGGGGCA TTGTCGCCTG GTCGGGTGGC
GTTAATCTGG CGAACCGTCT TTCCGAAACG TTTGCTGTGA TGAATGCGCC AGGAATTAAA
GATGCTTATG TCAATGGGCA AAAATATCGC ACAACAAACC GTAATGGAGT GGTGATATAC
GACGGAATGA CACCTTATCG GGAAAATCAC CTGATGCTGG ATGTGTCGCA AAGCGATAGC
GAAGCAGAAT TACGTGGCAA CCGGAAAATT GCCGCCCCTT ATCGCGGCGC GGTTGTACTG
GTTAATTTTG ATACCGATCA GCGCAAGCCA TGGTTTATAA AAGCGTTAAG AGCAGATGGG
CAATCATTAA CGTTTGGTTA TGAAGTCAAT GATATCCATG GTCATAATAT TGGCGTTGTC
GGCCAGGGAA GTCAGTTATT TATTCGCACC AATGAAGTAC CGCCATCGGT TAATGTGGCA
ATTGATAAGC AACAAGGACT TTCATGCACA ATCACCTTCG GTAAAGAGAT TGATGAAAGT
AGAAATTATA TTTGCCAGTA A
 
Protein sequence
MLRMTPLASA IVALLLGIEA YAAEETFDTH FMIGGMKDQQ VANIRLDDNQ PLPGQYDIDI 
YVNKQWRGKY EIIVKDNPQE TCLSREVIKR LGINSDNFAS GKQCLTFEQL VQGGSYTWDI
GVFRLDFSVP QAWVEELESG YVPPENWERG INAFYTSYYL SQYYSDYKAS GNNKSTYVRF
NSGLNLLGWQ LHSDASFSKT NNNPGVWKSN TLYLERGFAQ LLGTLRVGDM YTSSDIFDSV
RFRGVRLFRD MQMLPNSKQN FTPRVQGIAQ SNALVTIEQN GFVVYQKEVP PGPFAITDLQ
LAGGGADLDV SVKEADGSVT TYLVPYAAVP NMLQPGVSKY DLAAGRSHIE GASKQSDFVQ
AGYQYGFNNL LTLYGGSMVA NNYYAFTLGA GWNTRIGAIS VDATKSHSKQ DNGDVFDGQS
YQIAYNKFVS QTSTRFGLAA WRYSSRDYRT FNDHVWANNK DNYRRDENDV YDIADYYQND
FGRKNSFSAN MSQSLPEGWG SVSLSTLWRD YWGRSGSSKD YQLSYSNNLR RISYTLAASQ
AYDENHHEEK RFNIFISIPF DWGDDVSTPR RQIYMSNSTT FDDQGFASNN TGLSGTVGSR
DQFNYGVNLS HQHQGNETTA GANLTWNAPV ATVNGSYSQS STYRQAGASV SGGIVAWSGG
VNLANRLSET FAVMNAPGIK DAYVNGQKYR TTNRNGVVIY DGMTPYRENH LMLDVSQSDS
EAELRGNRKI AAPYRGAVVL VNFDTDQRKP WFIKALRADG QSLTFGYEVN DIHGHNIGVV
GQGSQLFIRT NEVPPSVNVA IDKQQGLSCT ITFGKEIDES RNYICQ