Gene EcDH1_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0654 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp693898 
End bp696363 
Gene Length2466 bp 
Protein Length821 aa 
Translation table11 
GC content42% 
IMG OID 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionACX38340 
Protein GI260447918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGGAA ATATAGGGGC AAATCCAGTT ATCATCATTG GTTGTGCGTC AGCTTATGCC 
GTTGAATTCA ACAAAGATTT AATCGAAGCC GAAGATCGTG AAAACGTTAA CCTTTCCCAA
TTTGAAACTG ATGGCCAATT ACCCGTCGGC AAATATTCAC TAAGCACTCT GATTAATAAT
AAGAGGACGC CAATCCACCT TGACCTCCAA TGGGTATTAA TTGATAACCA AACTGCAGTT
TGCGTGACAC CAGAGCAATT AACATTATTA GGATTTACTG ATGAATTTAT TGAAAAAACT
CAGCAAAACC TGATCGATGG TTGTTACCCT ATCGAAAAAG AAAAACAAAT TACAACTTAT
CTCGATAAAG GGAAAATGCA ATTATCCATA TCTGCACCTC AGGCATGGTT AAAATACAAA
GATGCAAACT GGACGCCTCC TGAACTTTGG AATCATGGTA TTGCTGGGGC ATTTCTTGAC
TACAATTTAT ATGCCTCTCA TTATGCACCA CATCAGGGCG ATAATTCGCA AAATATAAGT
TCCTATGGGC AGGCTGGGGT TAATCTTGGG GCCTGGCGCC TGCGTACTGA TTACCAGTAC
GATCAGTCAT TTAACAATGG CAAAAGCCAG GCGACCAACC TGGATTTTCC GCGTATTTAT
TTGTTTCGCC CAATCCCAGC AATGAATGCA AAACTAACTA TAGGTCAATA CGATACTGAA
TCCTCTATTT TCGACTCTTT CCATTTTTCT GGCATTTCGT TGAAAAGCGA TGAGAATATG
TTACCGCCAG ACCTACGTGG TTACGCACCG CAAATCACGG GTGTCGCACA AACGAATGCA
AAGGTCACTG TCTCACAGAA CAACCGTATT ATTTATCAAG AAAATGTTCC TCCAGGCCCA
TTTGCTATTA CCAATTTATT CAATACATTA CAGGGGCAAC TTGACGTCAA GGTTGAAGAA
GAGGACGGAC GCGTTACGCA ATGGCAAGTT GCATCTAATA GTATTCCTTA TCTGACGCGT
AAAGGGCAGA TTCGCTACAC CACTGCTATG GGTAAACCGA CCAGCGTTGG TGGTGATTCC
TTACAACAAC CCTTCTTCTG GACTGGTGAA TTCTCATGGG GTTGGCTGAA CAATGTATCC
CTGTATGGTG GTTCAGTTTT AACAAACCGT GATTATCAAT CTCTGGCTGC CGGCGTTGGT
TTTAATCTTA ACTCATTAGG TTCATTATCT TTTGATGTCA CACGATCTGA TGCTCAGTTG
CATAATCAGG ATAAAGAAAC GGGTTATAGC TACCGCGCTA ACTATTCAAA ACGTTTTGAA
TCTACCGGTA GCCAGCTCAC TTTCGCTGGT TACCGTTTCT CTGATAAAAA CTTTGTGACA
ATGAATGAAT ATATCAATGA CACTAACCAT TACACGAATT ATCAGAATGA AAAAGAGAGT
TATATTGTCA CGTTTAACCA GTATCTTGAA TCATTAAGGT TAAATACATA CGTAAGTTTG
GCTCGTAATA CTTACTGGGA CGCCAGCAGT AATGTGAATT ATTCATTATC ACTTAGCCGC
GATTTTGATA TCGGGCCATT AAAAAACGTC TCCACTTCTC TAACATTTAG CCGAATAAAC
TGGGAAGAAG ACAACCAGGA TCAACTGTAC CTAAATATTT CGATTCCCTG GGGAACTAGT
AGAACATTGA GCTATGGTAT GCAACGAAAT CAGGATAATG AGATTTCGCA TACTGCTTCG
TGGTATGACT CTTCCGATCG AAATAATTCC TGGAGCGTTT CTGCTTCAGG CGACAATGAT
GAATTCAAAG ATATGAAAGC GTCACTACGC GCCAGTTATC AGCATAATAC CGAGAACGGT
CGACTCTACC TCTCCGGTAC ATCACAGCGA GACAGTTATT ATTCTCTGAA TGCCAGTTGG
AATGGTTCAT TCACTGCGAC TCGCCACGGT GCCGCTTTCC ACGACTATAG CGGTAGTGCT
GACTCGCGTT TTATGATCGA CGCAGACGGC ACTGAAGATA TTCCGTTGAA CAATAAACGC
GCGGTAACTA ATCGGTATGG CATCGGAGTT ATTCCATCAG TCAGCAGTTA CATAACAACA
TCATTAAGTG TTGATACCCG AAATCTGCCA GAAAATGTGG ATATCGAAAA CTCGGTTATC
ACCACCACCT TAACCGAGGG TGCTATTGGC TACGCCAAAC TTGATACCCG CAAGGGCTAC
CAAATCATAG GGGTTATTCG CCTGGCAGAT GGTAGTCATC CACCACTGGG GATTAGCGTA
AAAGATGAAA CCAGCCACAA AGAATTAGGA CTGGTTGCTG ATGGCGGCTT TGTATACCTC
AACGGCATTC AGGATGATAA CAAACTTGCT TTACGCTGGG GTGACAAATC TTGTTTTATT
CAACCACCCA ATAGCAGCAA CTTAACCACC GGAACGGCTA TTTTACCGTG TATTAGCCAA
AATTAA
 
Protein sequence
MSGNIGANPV IIIGCASAYA VEFNKDLIEA EDRENVNLSQ FETDGQLPVG KYSLSTLINN 
KRTPIHLDLQ WVLIDNQTAV CVTPEQLTLL GFTDEFIEKT QQNLIDGCYP IEKEKQITTY
LDKGKMQLSI SAPQAWLKYK DANWTPPELW NHGIAGAFLD YNLYASHYAP HQGDNSQNIS
SYGQAGVNLG AWRLRTDYQY DQSFNNGKSQ ATNLDFPRIY LFRPIPAMNA KLTIGQYDTE
SSIFDSFHFS GISLKSDENM LPPDLRGYAP QITGVAQTNA KVTVSQNNRI IYQENVPPGP
FAITNLFNTL QGQLDVKVEE EDGRVTQWQV ASNSIPYLTR KGQIRYTTAM GKPTSVGGDS
LQQPFFWTGE FSWGWLNNVS LYGGSVLTNR DYQSLAAGVG FNLNSLGSLS FDVTRSDAQL
HNQDKETGYS YRANYSKRFE STGSQLTFAG YRFSDKNFVT MNEYINDTNH YTNYQNEKES
YIVTFNQYLE SLRLNTYVSL ARNTYWDASS NVNYSLSLSR DFDIGPLKNV STSLTFSRIN
WEEDNQDQLY LNISIPWGTS RTLSYGMQRN QDNEISHTAS WYDSSDRNNS WSVSASGDND
EFKDMKASLR ASYQHNTENG RLYLSGTSQR DSYYSLNASW NGSFTATRHG AAFHDYSGSA
DSRFMIDADG TEDIPLNNKR AVTNRYGIGV IPSVSSYITT SLSVDTRNLP ENVDIENSVI
TTTLTEGAIG YAKLDTRKGY QIIGVIRLAD GSHPPLGISV KDETSHKELG LVADGGFVYL
NGIQDDNKLA LRWGDKSCFI QPPNSSNLTT GTAILPCISQ N