Gene EcDH1_1611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1611 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1757898 
End bp1759376 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content54% 
IMG OID 
Productpolysaccharide biosynthesis protein 
Protein accessionACX39276 
Protein GI260448854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0000161996 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTAC GTGAAAAAAC CATCAGCGGC GCGAAGTGGT CGGCGATTGC CACGGTGATC 
ATCATCGGCC TCGGGCTGGT GCAGATGACC GTGCTGGCGC GGATTATCGA CAACCACCAG
TTCGGCCTGC TTACCGTGTC GCTGGTGATT ATCGCGCTGG CAGATACGCT TTCTGACTTC
GGTATCGCTA ACTCGATTAT TCAGCGAAAA GAAATCAGTC ACCTTGAACT CACCACGTTG
TACTGGCTGA ACGTCGGGCT GGGGATCGTG GTGTGCGTGG CGGTGTTTTT GTTGAGTGAT
CTCATCGGCG ACGTGCTGAA TAACCCGGAC CTGGCACCGT TGATTAAAAC ATTATCGCTG
GCGTTTGTGG TAATCCCCCA CGGGCAACAG TTCCGCGCGT TGATGCAAAA AGAGCTGGAG
TTCAACAAAA TCGGCATGAT CGAAACCAGC GCGGTGCTGG CGGGCTTCAC TTGTACGGTG
GTTAGCGCCC ATTTCTGGCC GCTGGCGATG ACCGCGATCC TCGGTTATCT GGTCAATAGT
GCGGTGAGAA CGCTGCTGTT TGGCTACTTT GGCCGCAAAA TTTATCGCCC CGGTCTGCAT
TTCTCGCTGG CGTCGGTGGC ACCGAACTTA CGCTTTGGTG CCTGGCTGAC GGCGGACAGC
ATCATCAACT ATCTCAATAC CAACCTTTCA ACGCTCGTGC TGGCGCGTAT TCTCGGCGCG
GGCGTGGCAG GGGGATACAA CCTGGCGTAC AACGTGGCCG TTGTGCCACC GATGAAGCTG
AACCCAATCA TCACCCGCGT GTTGTTTCCG GCATTCGCCA AAATTCAGGA CGATACCGAA
AAGCTGCGTG TTAACTTCTA CAAGCTGCTG TCGGTAGTGG GGATTATCAA CTTTCCGGCG
CTGCTCGGGC TAATGGTGGT GTCGAATAAC TTTGTACCGC TGGTCTTTGG TGAGAAGTGG
AACAGCATTA TTCCGGTGCT GCAATTGCTG TGTGTGGTGG GTCTGCTGCG CTCCGTAGGT
AACCCGATTG GTTCGCTGCT GATGGCGAAA GCGCGGGTCG ATATCAGCTT TAAATTCAAC
GTATTCAAAA CATTTCTGTT TATTCCGGCG ATTGTTATAG GTGGGCAGAT GGCGGGCGCG
ATCGGCGTCA CGCTTGGCTT CCTGCTGGTG CAAATTATCA ACACCATTCT GAGTTACTTC
GTGATGATTA AACCGGTTCT TGGTTCCAGT TATCGCCAGT ACATCCTGAG TTTATGGCTG
CCGTTTTATC TCTCGCTGCC GACGCTGGTG GTCAGTTATG CGCTGGGCAT TGTGCTGAAA
GGGCAACTGG CGCTGGGGAT GCTGCTGGCG GTGCAAATAG CCACGGGGGT GCTGGCGTTT
GTGGTGATGA TTGTGCTGTC GCGCCATCCG CTGGTGGTGG AAGTGAAGCG TCAGTTTTGT
CGCAGCGAAA AAATGAAAAT GCTTTTACGG GCGGGGTGA
 
Protein sequence
MSLREKTISG AKWSAIATVI IIGLGLVQMT VLARIIDNHQ FGLLTVSLVI IALADTLSDF 
GIANSIIQRK EISHLELTTL YWLNVGLGIV VCVAVFLLSD LIGDVLNNPD LAPLIKTLSL
AFVVIPHGQQ FRALMQKELE FNKIGMIETS AVLAGFTCTV VSAHFWPLAM TAILGYLVNS
AVRTLLFGYF GRKIYRPGLH FSLASVAPNL RFGAWLTADS IINYLNTNLS TLVLARILGA
GVAGGYNLAY NVAVVPPMKL NPIITRVLFP AFAKIQDDTE KLRVNFYKLL SVVGIINFPA
LLGLMVVSNN FVPLVFGEKW NSIIPVLQLL CVVGLLRSVG NPIGSLLMAK ARVDISFKFN
VFKTFLFIPA IVIGGQMAGA IGVTLGFLLV QIINTILSYF VMIKPVLGSS YRQYILSLWL
PFYLSLPTLV VSYALGIVLK GQLALGMLLA VQIATGVLAF VVMIVLSRHP LVVEVKRQFC
RSEKMKMLLR AG