Gene EcDH1_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1387 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1491250 
End bp1492977 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content47% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionACX39059 
Protein GI260448637 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.336718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAATA AAAATATAAT CATGTTGCTT ATGAGTAGTT TGATTTTGTC AGGATGTGGG 
CCGCAACCTG AGAATAAGGA AAGTCAGCAA CAACAACCCA GTACTCCCAC AGAGCAGCAA
GTGCTTGCCG CGCAGCAAGC TGCAATAAAA GAGGCTGAGC AAAGCGCCGC CGCCGCGAAA
GCCTTGGCCC AGCAAGAAGT GCAACAATAT TCAGACAAAC AGGCTTTACA GGGGCGATTG
CAGGAAGCGC CAACATTTGC AAGAGCGGCT AAAGCAAAAG CTACACATAT CGCAAATCCA
GGAACCGCTC GCTACCAGCA GTTCGATGAT AATCCGGTTA AGCAGGTAGC GCAAAATCCG
TTGGCGACGT TTAGTCTTGA CGTTGACACT GGCAGTTATG CGAATGTAAG GCGTTTCCTC
AATCAAGGGC TGTTACCTCC GCCAGACGCT GTGCGGGTGG AGGAGATAGT CAATTATTTC
CCGTCTGATT GGGATATCAA AGACAAACAA TCTATTCCGG CCTCTAAGCC AATACCTTTC
GCTATGCGCT ACGAATTGGC ACCTGCACCA TGGAATGAAC AGCGAACATT GCTGAAAGTT
GATATCCTGG CGAAAGATCG CAAAAGTGAA GAGTTACCAG CTTCTAATCT GGTCTTTCTT
ATCGACACTT CTGGTTCAAT GATTTCTGAT GAACGTTTGC CACTTATCCA GTCTTCGTTG
AAATTATTGG TCAAAGAACT TCGTGAGCAG GATAACATTG CCATCGTGAC CTACGCTGGC
GACTCCCGTA TTGCATTGCC TTCTATCTCC GGGAGTCATA AGGCGGAAAT TAATGCCGCA
ATTGATTCGC TGGATGCCGA AGGCAGTACC AATGGCGGTG CCGGGCTGGA ACTGGCTTAT
CAGCAGGCGA CGAAAGGGTT TATTAAGGGC GGCATCAATC GCATTTTATT AGCCACTGAC
GGTGACTTTA ACGTTGGCAT TGACGATCCA AAATCGATTG AATCAATGGT CAAAAAACAG
CGGGAGTCTG GTGTTACTCT GTCGACGTTT GGCGTGGGGA ATAGCAATTA CAACGAGGCA
ATGATGGTGC GAATTGCCGA TGTTGGTAAC GGCAACTACA GCTACATTGA TACCCTCTCT
GAAGCGCAGA AAGTATTGAA TAGTGAAATG CGGCAGATGT TGATTACCGT AGCAAAAGAT
GTCAAAGCGC AAATTGAGTT TAACCCCGCG TGGGTAACGG AATACCGTCA GATTGGTTAT
GAAAAGCGCC AACTTCGGGT GGAACATTTT AATAACGACA ACGTTGATGC AGGGGATATA
GGCGCAGGCA AACATATAAC GTTGTTATTC GAATTAACGC TGAACGGGCA AAAAGCATCA
ATTGATAAGT TACGCTATGC CCCGGATAAC AAATTAGCGA AATCGGACAA AACGAAAGAA
CTGGCCTGGT TAAAAATTCG CTGGAAATAC CCGCAGGGAA AAGAAAGTCA GTTAGTTGAA
TTCCCGCTGG GGCCAACAAT AAACGCGCCC TCTGAAGATA TGCGTTTTCG CGCAGCAGTA
GCTGCATATG GGCAAAAGTT ACGCGGTTCT GAATACCTGA ACAATACCTC CTGGCAGCAG
ATCAAACAGT GGGCTCAGCA GGCAAAAGGG GAAGATCCAC AGGGTTACAG GGCGGAATTT
ATTCGCCTGA TTGAACTGGC GGATGGTGTG ACTGACATCA GTCAGTGA
 
Protein sequence
MRNKNIIMLL MSSLILSGCG PQPENKESQQ QQPSTPTEQQ VLAAQQAAIK EAEQSAAAAK 
ALAQQEVQQY SDKQALQGRL QEAPTFARAA KAKATHIANP GTARYQQFDD NPVKQVAQNP
LATFSLDVDT GSYANVRRFL NQGLLPPPDA VRVEEIVNYF PSDWDIKDKQ SIPASKPIPF
AMRYELAPAP WNEQRTLLKV DILAKDRKSE ELPASNLVFL IDTSGSMISD ERLPLIQSSL
KLLVKELREQ DNIAIVTYAG DSRIALPSIS GSHKAEINAA IDSLDAEGST NGGAGLELAY
QQATKGFIKG GINRILLATD GDFNVGIDDP KSIESMVKKQ RESGVTLSTF GVGNSNYNEA
MMVRIADVGN GNYSYIDTLS EAQKVLNSEM RQMLITVAKD VKAQIEFNPA WVTEYRQIGY
EKRQLRVEHF NNDNVDAGDI GAGKHITLLF ELTLNGQKAS IDKLRYAPDN KLAKSDKTKE
LAWLKIRWKY PQGKESQLVE FPLGPTINAP SEDMRFRAAV AAYGQKLRGS EYLNNTSWQQ
IKQWAQQAKG EDPQGYRAEF IRLIELADGV TDISQ