Gene EcDH1_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1941 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2093801 
End bp2095447 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content50% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionACX39598 
Protein GI260449176 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.889751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTGA CATTAACGTT TAACGAACAA CGTCGTGCGG CGTATCGTCA GCAAGGGTTA 
TGGGGCGATG CTTCGCTGGC CGATTACTGG CAGCAGACCG CTCGTGCGAT GCCAGACAAA
ATTGCCGTGG TCGATAATCA TGGTGCATCG TACACCTATA GCGCGCTCGA TCACGCCGCG
AGCTGTCTGG CAAACTGGAT GTTAGCGAAG GGTATTGAAT CAGGCGATCG CATCGCATTT
CAACTGCCTG GCTGGTGTGA ATTTACCGTT ATCTATCTTG CCTGCCTGAA AATCGGTGCA
GTTTCCGTGC CGCTGTTGCC TTCCTGGCGG GAAGCAGAAC TGGTGTGGGT GCTCAATAAG
TGTCAGGCAA AAATGTTCTT TGCACCGACG TTGTTTAAAC AAACGCGTCC GGTAGATTTA
ATCCTGCCGC TGCAAAATCA GCTTCCACAA CTACAACAAA TTGTCGGCGT GGACAAACTG
GCTCCCGCCA CCTCTTCCCT CTCATTAAGT CAGATTATCG CCGACAATAC CTCACTGACC
ACGGCGATAA CGACCCACGG CGATGAATTA GCTGCGGTGC TGTTTACCTC CGGAACCGAG
GGTCTGCCAA AGGGCGTGAT GCTAACGCAT AACAATATTC TCGCCAGTGA GCGGGCTTAT
TGCGCGCGAC TGAATCTGAC CTGGCAGGAT GTCTTTATGA TGCCTGCGCC ACTTGGTCAC
GCAACGGGCT TTCTGCATGG CGTAACGGCA CCATTCTTAA TTGGCGCTCG CAGCGTGTTG
TTAGATATTT TCACTCCTGA TGCGTGTCTC GCGCTGCTTG AGCAGCAGCG TTGCACCTGT
ATGCTCGGCG CAACGCCGTT TGTCTATGAT CTTTTGAATG TACTAGAGAA ACAACCCGCG
GACCTTTCAG CGCTGCGTTT CTTTCTTTGC GGCGGAACCA CAATCCCCAA AAAAGTGGCG
CGTGAATGCC AGCAGCGCGG CATTAAATTA TTAAGTGTTT ATGGTTCCAC AGAAAGTTCG
CCGCATGCGG TGGTGAATCT CGATGATCCT TTGTCGCGCT TTATGCACAC CGATGGTTAC
GCTGCCGCAG GTGTAGAGAT TAAAGTGGTC GATGACGCAC GCAAGACCTT ACCGCCAGGT
TGCGAAGGTG AAGAAGCCTC GCGTGGCCCC AATGTGTTTA TGGGGTATTT TGATGAACCT
GAATTAACCG CCCGTGCCCT GGATGAAGAA GGCTGGTATT ACAGCGGCGA TCTCTGCCGT
ATGGATGAGG CTGGCTATAT AAAAATTACC GGACGCAAAA AAGATATTAT TGTCCGCGGC
GGCGAAAATA TTAGCAGCCG TGAAGTGGAA GATATTTTAT TGCAGCATCC TAAAATTCAC
GATGCCTGTG TGGTTGCAAT GTCCGATGAA CGTTTAGGTG AACGATCATG CGCTTATGTC
GTGCTGAAAG CGCCGCATCA TTCATTATCG CTGGAAGAGG TAGTGGCTTT TTTTAGCCGT
AAACGGGTCG CAAAATATAA ATATCCTGAA CATATCGTGG TAATCGAAAA ACTACCGCGA
ACTACCTCAG GTAAAATACA AAAGTTTTTG TTAAGAAAAG ATATTATGCG GCGTTTAACG
CAGGATGTCT GTGAAGAGAT TGAATAA
 
Protein sequence
MKVTLTFNEQ RRAAYRQQGL WGDASLADYW QQTARAMPDK IAVVDNHGAS YTYSALDHAA 
SCLANWMLAK GIESGDRIAF QLPGWCEFTV IYLACLKIGA VSVPLLPSWR EAELVWVLNK
CQAKMFFAPT LFKQTRPVDL ILPLQNQLPQ LQQIVGVDKL APATSSLSLS QIIADNTSLT
TAITTHGDEL AAVLFTSGTE GLPKGVMLTH NNILASERAY CARLNLTWQD VFMMPAPLGH
ATGFLHGVTA PFLIGARSVL LDIFTPDACL ALLEQQRCTC MLGATPFVYD LLNVLEKQPA
DLSALRFFLC GGTTIPKKVA RECQQRGIKL LSVYGSTESS PHAVVNLDDP LSRFMHTDGY
AAAGVEIKVV DDARKTLPPG CEGEEASRGP NVFMGYFDEP ELTARALDEE GWYYSGDLCR
MDEAGYIKIT GRKKDIIVRG GENISSREVE DILLQHPKIH DACVVAMSDE RLGERSCAYV
VLKAPHHSLS LEEVVAFFSR KRVAKYKYPE HIVVIEKLPR TTSGKIQKFL LRKDIMRRLT
QDVCEEIE