Gene EcDH1_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0491 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp518576 
End bp520957 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content46% 
IMG OID 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionACX38179 
Protein GI260447757 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA AAACGTTACT GGCCTACACC ATTGGTTTTG CCTTTTCTCC CCCAGCCAAT 
GCAGATGGTA TAGAGATTGC CGCTGTTGAT TTTGATCGGG AAACATTAAA ATCACTAGGT
GTAGATCCTA ATATATCGCA TTATTTTTCC CGTTCAGCCC GTTTTTTGCC AGGTGAATAT
TCACTGATAG TATCAGTAAA TGGCGAAAAA AAAGGCAACA TTGCTACGCG CTTTGATGAA
AATGGCGACA TTTGTCTTGA TCAGGCATTT CTGCAACAAG CCGGTTTAAA AATTCCTTCT
GAAGAAAAAA ATGGCTGTTA CGACTATATA TTGTCCTACC CGGGTACCAC AATCACACCA
TTACCTAACC AGGAAGCGTT AGATATTATC GTTTCACCAC AGGCGATCAT TCCCATAGGG
TTGGATCTCA CAAACGCAGC AACTGGTGGA ACAGCTGCGC TGCTAAACTA CTCTCTGATG
AGCAGCCGTG CAGAATTTTC TAATGGGAGT TCGGACTACT CCCAGGCTGC ACTTGAAGGC
GGGATTAATA TTAATGACTG GATGTTACGC AGCCATCAGT TCCTTACACA AACAAATGGC
ACATTCAGTA ACCAGAACTC GTCAACCTAC CTTCAACGTA CCTTTACAGA TCTTAAAACA
CTCATGCGAG CAGGTGAAGT TAACCTCAAT AATAGCGTGT TGGAAGGAGC CAGTATTTAC
GGTATCGAAA TCGCACCGGA CAACGCATTG CAAACCAGCG GCAGTGGTGT GCAAGTTACT
GGTATAGCCA ACACCTCTCA GGCTCGTGTC GAGATTCGTC AACAAGGAGT TTTAATTCAT
TCCATTCTGG TTCCTGCGGG CGCATTCACT ATCCCTGATG TACCTGTTCG CAATGGTAAT
AGTGATCTTA ATGTCACCGT TGTCGAAACA GACGGTAGTT CGCACAACTA TATTGTTCCC
TCCACCCTGT TTAATCAGCA TGTAGAAAGC TTCCAGGGTT ATCGCTTCGC GATAGGGCGG
GTAGACGATG ACTATGACGA ATCACCTTGG GTAATTAGTG CATCGAGCGG ATGGAATCTG
ACACGCTGGA GTGCAATGAA CGGCGGCGTT ATCGTAGCAG AAAATTATCA GGCGGCATCA
ATCCGGTCGA GTCTGGTTCC CCTGCCCGAT TTAACAGTGA GCAGCCAAAT TAGTACATCG
CAGGATACGA AAGACTCACT GCAAGGACAG AAATATCGTC TTGACGCGAA CTACAATCTC
CCATTTTCAC TTGGGCTAAC AACCAGCCTC ACTCGATCTG ATCGCCATTA TCGCGAACTG
TCTGAAGCGA TTGATGATGA TTATACCGAT CCGACTAAAA GCACTTATGC GCTTGGTTTA
AACTGGTCTA ACTCCATTCT GGGTGGTTTC AACATCAGTG GCTATAAAAC ATATAGTTAC
GACGGTGACA ATGACTCAAG CAACCTTAAT ATTAACTGGA ACAAAGCGTT CAAACACGCC
ACGGTTTCCG TCAACTGGCA GCATCAACTT AGTGCTTCAG AAAATAATGA AGACGATGGT
GATCTGTTCT ACGTCAACAT CAGTATTCCA TTTGGCAGAT CAAACACCGC CACACTGTAT
ACTCGCCATG ACGATCATAA AACCCACTAT GGTACTGGTG TCATGGGAGT CGTCTCAGAT
GAGATGTCCT ACTATGTGAA TGCTGAACGA GATCACGACG AACGTGAAAC GAGCTTGAAC
GGCAGTATCA GTTCCAATCT CCATTACACC CAAGTCAGCC TTGCCGCAGG AGCAAGCGGC
AGTGATAGCC GTACTTACAA CGGTACGATG TCAGGTGGTA TTGCCGTACA TGATCAGGGA
GTGACCTTTT CACCGTGGAC TATCAATGAC ACTTTCGCCA TCGCAAAAAT GGATAACAAT
ATTGCAGGTG TCAGAATTAC ATCTCAGGCA GGCCCAGTAT GGACAGATTT TCGGGGTAAT
GCCGTCATTC CATCAATCCA GCCGTGGCGA ACATCAGGAG TTGAGATCGA TACCGCCAGC
TTGCCAAAAA ATGTCGATAT CGGTAACGGC ACAAAAATGA TCAAACAAGG CCGTGGTGCA
GTAGGGAAAG TCGGATTCAG TGCGATAACA CAACGCCGTG CATTACTCAA TATCACACTT
TCCGACGGCA AAAAACTGCC CAGAGGCGTT GCGATTGAAG ATAGTGAAGG CAACTATCTG
ACAACATCAG TGGATGACGG TGTTGTATTC CTCAATAACA TCAAACCGGA CATGGTGCTA
GATATAAAAG ATGAGCAGCA ATCATGCCGT ATTCACCTTA CATTCCCAGA AGATGCACCA
AAAGATGTGT TCTATGAGAC AGCAACAGGA GAGTGCCAAT GA
 
Protein sequence
MLKKTLLAYT IGFAFSPPAN ADGIEIAAVD FDRETLKSLG VDPNISHYFS RSARFLPGEY 
SLIVSVNGEK KGNIATRFDE NGDICLDQAF LQQAGLKIPS EEKNGCYDYI LSYPGTTITP
LPNQEALDII VSPQAIIPIG LDLTNAATGG TAALLNYSLM SSRAEFSNGS SDYSQAALEG
GININDWMLR SHQFLTQTNG TFSNQNSSTY LQRTFTDLKT LMRAGEVNLN NSVLEGASIY
GIEIAPDNAL QTSGSGVQVT GIANTSQARV EIRQQGVLIH SILVPAGAFT IPDVPVRNGN
SDLNVTVVET DGSSHNYIVP STLFNQHVES FQGYRFAIGR VDDDYDESPW VISASSGWNL
TRWSAMNGGV IVAENYQAAS IRSSLVPLPD LTVSSQISTS QDTKDSLQGQ KYRLDANYNL
PFSLGLTTSL TRSDRHYREL SEAIDDDYTD PTKSTYALGL NWSNSILGGF NISGYKTYSY
DGDNDSSNLN INWNKAFKHA TVSVNWQHQL SASENNEDDG DLFYVNISIP FGRSNTATLY
TRHDDHKTHY GTGVMGVVSD EMSYYVNAER DHDERETSLN GSISSNLHYT QVSLAAGASG
SDSRTYNGTM SGGIAVHDQG VTFSPWTIND TFAIAKMDNN IAGVRITSQA GPVWTDFRGN
AVIPSIQPWR TSGVEIDTAS LPKNVDIGNG TKMIKQGRGA VGKVGFSAIT QRRALLNITL
SDGKKLPRGV AIEDSEGNYL TTSVDDGVVF LNNIKPDMVL DIKDEQQSCR IHLTFPEDAP
KDVFYETATG ECQ