Gene EcDH1_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0665 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp704167 
End bp705648 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content51% 
IMG OID 
Producttype I secretion outer membrane protein, TolC family 
Protein accessionACX38351 
Protein GI260447929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT TGCTCCCCAT TCTTATCGGC CTGAGCCTTT CTGGGTTCAG TTCGTTGAGC 
CAGGCCGAGA ACCTGATGCA AGTTTATCAG CAAGCACGCC TTAGTAACCC GGAATTGCGT
AAGTCTGCCG CCGATCGTGA TGCTGCCTTT GAAAAAATTA ATGAAGCGCG CAGTCCATTA
CTGCCACAGC TAGGTTTAGG TGCAGATTAC ACCTATAGCA ACGGCTACCG CGACGCGAAC
GGCATCAACT CTAACGCGAC CAGTGCGTCC TTGCAGTTAA CTCAATCCAT TTTTGATATG
TCGAAATGGC GTGCGTTAAC GCTGCAGGAA AAAGCAGCAG GGATTCAGGA CGTCACGTAT
CAGACCGATC AGCAAACCTT GATCCTCAAC ACCGCGACCG CTTATTTCAA CGTGTTGAAT
GCTATTGACG TTCTTTCCTA TACACAGGCA CAAAAAGAAG CGATCTACCG TCAATTAGAT
CAAACCACCC AACGTTTTAA CGTGGGCCTG GTAGCGATCA CCGACGTGCA GAACGCCCGC
GCACAGTACG ATACCGTGCT GGCGAACGAA GTGACCGCAC GTAATAACCT TGATAACGCG
GTAGAGCAGC TGCGCCAGAT CACCGGTAAC TACTATCCGG AACTGGCTGC GCTGAATGTC
GAAAACTTTA AAACCGACAA ACCACAGCCG GTTAACGCGC TGCTGAAAGA AGCCGAAAAA
CGCAACCTGT CGCTGTTACA GGCACGCTTG AGCCAGGACC TGGCGCGCGA GCAAATTCGC
CAGGCGCAGG ATGGTCACTT ACCGACTCTG GATTTAACGG CTTCTACCGG GATTTCTGAC
ACCTCTTATA GCGGTTCGAA AACCCGTGGT GCCGCTGGTA CCCAGTATGA CGATAGCAAT
ATGGGCCAGA ACAAAGTTGG CCTGAGCTTC TCGCTGCCGA TTTATCAGGG CGGAATGGTT
AACTCGCAGG TGAAACAGGC ACAGTACAAC TTTGTCGGTG CCAGCGAGCA ACTGGAAAGT
GCCCATCGTA GCGTCGTGCA GACCGTGCGT TCCTCCTTCA ACAACATTAA TGCATCTATC
AGTAGCATTA ACGCCTACAA ACAAGCCGTA GTTTCCGCTC AAAGCTCATT AGACGCGATG
GAAGCGGGCT ACTCGGTCGG TACGCGTACC ATTGTTGATG TGTTGGATGC GACCACCACG
TTGTACAACG CCAAGCAAGA GCTGGCGAAT GCGCGTTATA ACTACCTGAT TAATCAGCTG
AATATTAAAT CAGCTCTGGG TACGTTGAAC GAGCAGGATC TGCTGGCACT GAACAATGCG
CTGAGCAAAC CGGTTTCCAC TAATCCGGAA AACGTTGCAC CGCAAACGCC GGAACAGAAT
GCTATTGCTG ATGGTTATGC GCCTGATAGC CCGGCACCAG TCGTTCAGCA AACATCCGCA
CGCACTACCA CCAGTAACGG TCATAACCCT TTCCGTAACT GA
 
Protein sequence
MKKLLPILIG LSLSGFSSLS QAENLMQVYQ QARLSNPELR KSAADRDAAF EKINEARSPL 
LPQLGLGADY TYSNGYRDAN GINSNATSAS LQLTQSIFDM SKWRALTLQE KAAGIQDVTY
QTDQQTLILN TATAYFNVLN AIDVLSYTQA QKEAIYRQLD QTTQRFNVGL VAITDVQNAR
AQYDTVLANE VTARNNLDNA VEQLRQITGN YYPELAALNV ENFKTDKPQP VNALLKEAEK
RNLSLLQARL SQDLAREQIR QAQDGHLPTL DLTASTGISD TSYSGSKTRG AAGTQYDDSN
MGQNKVGLSF SLPIYQGGMV NSQVKQAQYN FVGASEQLES AHRSVVQTVR SSFNNINASI
SSINAYKQAV VSAQSSLDAM EAGYSVGTRT IVDVLDATTT LYNAKQELAN ARYNYLINQL
NIKSALGTLN EQDLLALNNA LSKPVSTNPE NVAPQTPEQN AIADGYAPDS PAPVVQQTSA
RTTTSNGHNP FRN