Gene EcDH1_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1838 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1988666 
End bp1990417 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content51% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionACX39496 
Protein GI260449074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0025116 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAACGG CATGTATATC ATTTGGGGTT GCGATGACGA CGAACACGCA TTTTAGAGGT 
GAAGAATTGA AGAAGGTTTG GCTTAACCGT TATCCCGCGG ACGTTCCGAC GGAGATCAAC
CCTGACCGTT ATCAATCTCT GGTAGATATG TTTGAGCAGT CGGTCGCGCG CTACGCCGAT
CAACCTGCGT TTGTGAATAT GGGGGAGGTA ATGACCTTCC GCAAGCTGGA AGAACGCAGT
CGCGCGTTTG CCGCTTATTT GCAACAAGGG TTGGGGCTGA AGAAAGGCGA TCGCGTTGCG
TTGATGATGC CTAATTTATT GCAATATCCG GTGGCGCTGT TTGGCATTTT GCGTGCCGGG
ATGATCGTCG TAAACGTTAA CCCGTTGTAT ACCCCGCGTG AGCTTGAGCA TCAGCTTAAC
GATAGCGGCG CATCGGCGAT TGTTATCGTG TCTAACTTTG CTCACACACT GGAAAAAGTG
GTTGATAAAA CCGCCGTTCA GCACGTAATT CTGACCCGTA TGGGCGATCA GCTATCTACG
GCAAAAGGCA CGGTAGTCAA TTTCGTTGTT AAATACATCA AGCGTTTGGT GCCGAAATAC
CATCTGCCAG ATGCCATTTC ATTTCGTAGC GCACTGCATA ACGGCTACCG GATGCAGTAC
GTCAAACCCG AACTGGTGCC GGAAGATTTA GCTTTTCTGC AATACACCGG CGGCACCACT
GGTGTGGCGA AAGGCGCGAT GCTGACTCAC CGCAATATGC TGGCGAACCT GGAACAGGTT
AACGCGACCT ATGGTCCGCT GTTGCATCCG GGCAAAGAGC TGGTGGTGAC GGCGCTGCCG
CTGTATCACA TTTTTGCCCT GACCATTAAC TGCCTGCTGT TTATCGAACT GGGTGGGCAG
AACCTGCTTA TCACTAACCC GCGCGATATT CCAGGGTTGG TAAAAGAGTT AGCGAAATAT
CCGTTTACCG CTATCACGGG CGTTAACACC TTGTTCAATG CGTTGCTGAA CAATAAAGAG
TTCCAGCAGC TGGATTTCTC CAGTCTGCAT CTTTCCGCAG GCGGTGGGAT GCCAGTGCAG
CAAGTGGTGG CAGAGCGTTG GGTGAAACTG ACCGGACAGT ATCTGCTGGA AGGCTATGGC
CTTACCGAGT GTGCGCCGCT GGTCAGCGTT AACCCATATG ATATTGATTA TCATAGTGGT
AGCATCGGTT TGCCGGTGCC GTCGACGGAA GCCAAACTGG TGGATGATGA TGATAATGAA
GTACCACCAG GTCAACCGGG TGAGCTTTGT GTCAAAGGAC CGCAGGTGAT GCTGGGTTAC
TGGCAGCGTC CCGATGCTAC CGATGAAATC ATCAAAAATG GCTGGTTACA CACCGGCGAC
ATCGCGGTAA TGGATGAAGA AGGATTCCTG CGCATTGTCG ATCGTAAAAA AGACATGATT
CTGGTTTCCG GTTTTAACGT CTATCCCAAC GAGATTGAAG ATGTCGTCAT GCAGCATCCT
GGCGTACAGG AAGTCGCGGC TGTTGGCGTA CCTTCCGGCT CCAGTGGTGA AGCGGTGAAA
ATCTTCGTAG TGAAAAAAGA TCCATCGCTT ACCGAAGAGT CACTGGTGAC TTTTTGCCGC
CGTCAGCTCA CGGGATACAA AGTACCGAAG CTGGTGGAGT TTCGTGATGA GTTACCGAAA
TCTAACGTCG GAAAAATTTT GCGACGAGAA TTACGTGACG AAGCGCGCGG CAAAGTGGAC
AATAAAGCCT GA
 
Protein sequence
MLTACISFGV AMTTNTHFRG EELKKVWLNR YPADVPTEIN PDRYQSLVDM FEQSVARYAD 
QPAFVNMGEV MTFRKLEERS RAFAAYLQQG LGLKKGDRVA LMMPNLLQYP VALFGILRAG
MIVVNVNPLY TPRELEHQLN DSGASAIVIV SNFAHTLEKV VDKTAVQHVI LTRMGDQLST
AKGTVVNFVV KYIKRLVPKY HLPDAISFRS ALHNGYRMQY VKPELVPEDL AFLQYTGGTT
GVAKGAMLTH RNMLANLEQV NATYGPLLHP GKELVVTALP LYHIFALTIN CLLFIELGGQ
NLLITNPRDI PGLVKELAKY PFTAITGVNT LFNALLNNKE FQQLDFSSLH LSAGGGMPVQ
QVVAERWVKL TGQYLLEGYG LTECAPLVSV NPYDIDYHSG SIGLPVPSTE AKLVDDDDNE
VPPGQPGELC VKGPQVMLGY WQRPDATDEI IKNGWLHTGD IAVMDEEGFL RIVDRKKDMI
LVSGFNVYPN EIEDVVMQHP GVQEVAAVGV PSGSSGEAVK IFVVKKDPSL TEESLVTFCR
RQLTGYKVPK LVEFRDELPK SNVGKILRRE LRDEARGKVD NKA