Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1387 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 1491250 |
End bp | 1492977 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | ACX39059 |
Protein GI | 260448637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.336718 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAATA AAAATATAAT CATGTTGCTT ATGAGTAGTT TGATTTTGTC AGGATGTGGG CCGCAACCTG AGAATAAGGA AAGTCAGCAA CAACAACCCA GTACTCCCAC AGAGCAGCAA GTGCTTGCCG CGCAGCAAGC TGCAATAAAA GAGGCTGAGC AAAGCGCCGC CGCCGCGAAA GCCTTGGCCC AGCAAGAAGT GCAACAATAT TCAGACAAAC AGGCTTTACA GGGGCGATTG CAGGAAGCGC CAACATTTGC AAGAGCGGCT AAAGCAAAAG CTACACATAT CGCAAATCCA GGAACCGCTC GCTACCAGCA GTTCGATGAT AATCCGGTTA AGCAGGTAGC GCAAAATCCG TTGGCGACGT TTAGTCTTGA CGTTGACACT GGCAGTTATG CGAATGTAAG GCGTTTCCTC AATCAAGGGC TGTTACCTCC GCCAGACGCT GTGCGGGTGG AGGAGATAGT CAATTATTTC CCGTCTGATT GGGATATCAA AGACAAACAA TCTATTCCGG CCTCTAAGCC AATACCTTTC GCTATGCGCT ACGAATTGGC ACCTGCACCA TGGAATGAAC AGCGAACATT GCTGAAAGTT GATATCCTGG CGAAAGATCG CAAAAGTGAA GAGTTACCAG CTTCTAATCT GGTCTTTCTT ATCGACACTT CTGGTTCAAT GATTTCTGAT GAACGTTTGC CACTTATCCA GTCTTCGTTG AAATTATTGG TCAAAGAACT TCGTGAGCAG GATAACATTG CCATCGTGAC CTACGCTGGC GACTCCCGTA TTGCATTGCC TTCTATCTCC GGGAGTCATA AGGCGGAAAT TAATGCCGCA ATTGATTCGC TGGATGCCGA AGGCAGTACC AATGGCGGTG CCGGGCTGGA ACTGGCTTAT CAGCAGGCGA CGAAAGGGTT TATTAAGGGC GGCATCAATC GCATTTTATT AGCCACTGAC GGTGACTTTA ACGTTGGCAT TGACGATCCA AAATCGATTG AATCAATGGT CAAAAAACAG CGGGAGTCTG GTGTTACTCT GTCGACGTTT GGCGTGGGGA ATAGCAATTA CAACGAGGCA ATGATGGTGC GAATTGCCGA TGTTGGTAAC GGCAACTACA GCTACATTGA TACCCTCTCT GAAGCGCAGA AAGTATTGAA TAGTGAAATG CGGCAGATGT TGATTACCGT AGCAAAAGAT GTCAAAGCGC AAATTGAGTT TAACCCCGCG TGGGTAACGG AATACCGTCA GATTGGTTAT GAAAAGCGCC AACTTCGGGT GGAACATTTT AATAACGACA ACGTTGATGC AGGGGATATA GGCGCAGGCA AACATATAAC GTTGTTATTC GAATTAACGC TGAACGGGCA AAAAGCATCA ATTGATAAGT TACGCTATGC CCCGGATAAC AAATTAGCGA AATCGGACAA AACGAAAGAA CTGGCCTGGT TAAAAATTCG CTGGAAATAC CCGCAGGGAA AAGAAAGTCA GTTAGTTGAA TTCCCGCTGG GGCCAACAAT AAACGCGCCC TCTGAAGATA TGCGTTTTCG CGCAGCAGTA GCTGCATATG GGCAAAAGTT ACGCGGTTCT GAATACCTGA ACAATACCTC CTGGCAGCAG ATCAAACAGT GGGCTCAGCA GGCAAAAGGG GAAGATCCAC AGGGTTACAG GGCGGAATTT ATTCGCCTGA TTGAACTGGC GGATGGTGTG ACTGACATCA GTCAGTGA
|
Protein sequence | MRNKNIIMLL MSSLILSGCG PQPENKESQQ QQPSTPTEQQ VLAAQQAAIK EAEQSAAAAK ALAQQEVQQY SDKQALQGRL QEAPTFARAA KAKATHIANP GTARYQQFDD NPVKQVAQNP LATFSLDVDT GSYANVRRFL NQGLLPPPDA VRVEEIVNYF PSDWDIKDKQ SIPASKPIPF AMRYELAPAP WNEQRTLLKV DILAKDRKSE ELPASNLVFL IDTSGSMISD ERLPLIQSSL KLLVKELREQ DNIAIVTYAG DSRIALPSIS GSHKAEINAA IDSLDAEGST NGGAGLELAY QQATKGFIKG GINRILLATD GDFNVGIDDP KSIESMVKKQ RESGVTLSTF GVGNSNYNEA MMVRIADVGN GNYSYIDTLS EAQKVLNSEM RQMLITVAKD VKAQIEFNPA WVTEYRQIGY EKRQLRVEHF NNDNVDAGDI GAGKHITLLF ELTLNGQKAS IDKLRYAPDN KLAKSDKTKE LAWLKIRWKY PQGKESQLVE FPLGPTINAP SEDMRFRAAV AAYGQKLRGS EYLNNTSWQQ IKQWAQQAKG EDPQGYRAEF IRLIELADGV TDISQ
|
| |