Gene EcDH1_3736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3736 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4026487 
End bp4027566 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content55% 
IMG OID 
Productpermease YjgP/YjgQ family protein 
Protein accessionACX41341 
Protein GI260450919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00233017 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGGAGA CGCTCAAAAG CCAGCTGGCG ATACTCTTCA TCTTGCTTTT GATCTTCTTC 
TGTCAAAAGT TAGTGAGGAT CCTCGGCGCA GCGGTTGACG GCGATATTCC GGCGAATCTG
GTGCTCTCCC TTCTCGGGTT GGGCGTGCCG GAAATGGCGC AGCTTATCCT GCCATTAAGC
CTGTTCCTCG GGCTGCTGAT GACGCTGGGC AAACTGTATA CCGAAAGTGA AATTACGGTA
ATGCATGCCT GCGGCCTGAG CAAAGCGGTT CTGGTGAAAG CGGCAATGAT CCTTGCGGTA
TTCACGGCAA TCGTAGCGGC GGTTAACGTG ATGTGGGCGG GACCGTGGTC ATCGCGTCAT
CAGGATGAAG TGTTAGCAGA AGCGAAAGCG AACCCTGGCA TGGCGGCGCT GGCGCAAGGG
CAATTCCAGC AAGCGACTAA TGGCAGCTCG GTGCTGTTCA TCGAAAGCGT TGACGGCAGC
GATTTCAAAG ATGTGTTCCT CGCGCAAATT CGACCAAAAG GTAATGCACG TCCTTCTGTG
GTGGTGGCCG ATTCCGGACA TTTAACCCAG CTGCGCGACG GCTCCCAGGT CGTCACTCTC
AACCAGGGAA CGCGCTTCGA AGGCACTGCA ATGTTACGTG ATTTCCGCAT TACGGACTTC
CAGGATTATC AGGCGATCAT TGGTCACCAG GCGGTGGCGC TCGACCCGAA CGATACCGAC
CAGATGGACA TGCGCACATT GTGGAACACT GACACCGATC GTGCTCGCGC AGAACTGAAC
TGGTGTATCA CGTTGGTATT CACCGTGTTT ATGATGGCAC TTATGGTCGT ACCGCTGAGC
GTGGTTAACC CACGTCAGGG ACGCGTACTG TCGATGCTGC CAGCCATGCT GCTGTATCTA
CTTTTCTTCC TGATCCAGAC CTCCCTGAAA TCGAACGGCG GTAAAGGTAA GCTGGACCCG
ACGCTGTGGA TGTGGACCGT TAACCTGATT TATCTGGCTT TAGCGATTGT TCTCAACCTT
TGGGACACCG TGCCGGTCCG CCGCCTGCGC GCCAGTTTTT CGCGTAAAGG AGCGGTGTGA
 
Protein sequence
MRETLKSQLA ILFILLLIFF CQKLVRILGA AVDGDIPANL VLSLLGLGVP EMAQLILPLS 
LFLGLLMTLG KLYTESEITV MHACGLSKAV LVKAAMILAV FTAIVAAVNV MWAGPWSSRH
QDEVLAEAKA NPGMAALAQG QFQQATNGSS VLFIESVDGS DFKDVFLAQI RPKGNARPSV
VVADSGHLTQ LRDGSQVVTL NQGTRFEGTA MLRDFRITDF QDYQAIIGHQ AVALDPNDTD
QMDMRTLWNT DTDRARAELN WCITLVFTVF MMALMVVPLS VVNPRQGRVL SMLPAMLLYL
LFFLIQTSLK SNGGKGKLDP TLWMWTVNLI YLALAIVLNL WDTVPVRRLR ASFSRKGAV