Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4222 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4584866 |
End bp | 4586317 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | ACX41820 |
Protein GI | 260451398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA GAGATGATCA TCGCACTGCT GGCCTCACCG CAGCTGGCGG TCTTCTTTGA AAAATTCCCA CGCCTGAAGG CGGCAATCAC TGATGATGTT CCCCGCTGGC GTGAAGCGCT GCGCAGTCGG CTGAAAGATG CCCGAGTCCC GCCGGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC CAGCTCCTCT CCACGCCACA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT CGTCTGAATT CTCCATGGGC AGAACAAGCC CGACAGTTGG TTGATGCTAA CAGCACGATC ACTTCAGCGT TACACACGCT TTTTCTCCAG CGTTGGCGTT TAAGTCTGAT CGTGCAAGCA ACGACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTGAG TGAAGTTCAG GAACGCATGA CGCTGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACTGCAGCT GGTCGTCTGT GGGATATGAG CGCCGGTCAG CTTAAACGTG GCGACTATCA GTTGATTGTG AAATACGGTG AATTTCTTAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA GCAGCTGGGG CGTTCTCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC ATGGTGCGCG AACCGGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT ATTTTACGTC TCCTGCCGCC AGAACTGGCG ACACTAGGGA TAACGGAACT GGAGTATGAG TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG CGTGAAAAAG TGATCGAACG TCCGGTGGTA CATAAAGATT ACGATGAACA GCCGCGCGGG CCGTTTATTG TCTGTGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG AAAGCGTTCT GCCTGGCCTT GATGCGCATT GCTCTCGCAG AAAACCGGCG CTGCTATATT ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAA GCAATCCGTT TTTTAAGCCA GCAGTTTCGT GGCGGCACCG ATCTTGCCAG TTGTTTTCGC GCCATTATGG AACGCTTGCA AAGCAGGGAA TGGTTTGATG CCGATGCGGT GGTGATTTCT GATTTTATCG CTCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GCTGCAGCGG GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ACGGCAAACC CGGCATCATG CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC TGGCGGCGAT AA
|
Protein sequence | MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSTI TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTLSGQLE PILADNNTAA GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT MVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI MLFSTEIVRY ELSGPQGIEQ AIRFLSQQFR GGTDLASCFR AIMERLQSRE WFDADAVVIS DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR WRR
|
| |