Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1063 |
Symbol | |
ID | 8415353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1284889 |
End bp | 1286556 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024026 |
Product | von Willebrand factor type A |
Protein accession | YP_003181423 |
Protein GI | 257790817 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000708626 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACGCTT CCAACCGCAG CCGCGGCAGG CTCGCCGCGG GCTCCATCGC CTTCGCCGTC CTGATCGCCG GCGCATCGCT TGCCGGATGC AGCCCCGACG GTCAGGCGGG CGACCAGCTA GGATCGGCTG CAAGCGAATC GGAGATCATG GCCATCGGTT CCGCGCTGTC CGAGACGGCC TCCACATGCC CTCCTCCCTA TCCGTACGTT CCCTCGCCCT CTCCCGGCGG CACCGAGGAA TACCGTGCAC TCGACGAGCC GGGGTTCCTC TCCCCTGCGA CCAGCCCGCT GTCCACGCTG TCGGCCGACG TCGACACGGC CTCGTACTGC AACCTGCGCC GCATGGTGGC GCAGAGATAC GCGCCGGCCG TCGTGCCCGC CGGCGCCGTA CGCACCGAAG AGCTGCTCAA CTACTTCGAC TACGCCTACC CGGAGCCCGT TGGCTCCGAC TTGTTCGGCG TATCGGCCCA GATGAGCGAC TGTCCTTGGA ACGACCAGAC GAAGCTGCTG GTCATGGGAT TCGCCACCGA GAAGGACGGC GACGCTTCGC CCACGGGCGC CAACCTCGTA TTCCTCATCG ACGTCTCGGG GTCGATGGAC GACCCTGACA AGCTCCCCCT GGTCAAAGAC TCGTTCGCCG CGCTCGTCGA AGGGCTGACG GAGCGCGACC GCGTGTCCGT CGTAACCTAC GCCAGCGGCG AGCGCGTGCT GCTCGAAGGC GTGCCGGGCG ACGACAAGCG GCGTATCATG CGCGCCGTCG ACAGCCTCGT CGCCGAAGGG TCGACGAACG GGGAAGCCGG TTTGGAGCAG GCGTACCGCC TGGCGGAATC GTCGTTCATC GAAGGCGGTG TGAACCGCGT CGTCATGGCG TCGGACGGCG ACCTCAACGT GGGCATCTCG TCCGAGAGCG AGCTGCACGA CTTCGTCGAG CAGAAGCGCG AGACCGGCGT GTACCTCTCG GTGCTGGGAT TCGGCTCGGG CAACTACAAG GACAACAAGA TGGAGACGCT GGCCGACCAC GGCAACGGCG CCTACCACTA CATCGACTGC GCCGAAGAAG CCCGACGGGT GCTCGGCCGG AACCTCCGTG CGAACCTCGT GCCGCTTGCC GACGATGTGA AGATCCAGGT GGAATTCAAT CCTGACCGGG TGAAGGGCTA TCGGCTGATC GGCTACGAGA ACCGCGCGCT CGCCGACGAG GAGTTCCGCG ACGATGCGGG CGAGGTGGGC GCGGGCCATG CGTTCACCGT GGCGTACGAG ATCGTCCCCG CAGGATCGGC GTTCGAGGTG GGCGCGTCCG CATCGAAATA CGGAAGCGAT GCCGACGACC GGCAGGACGG TCGCCGCTCC GAAGCGAACG GCGGAGAATG GCTGACGTGC ACGATGCGCT ACCGCCCTGC GGGAACCGTC GAAGCGGTGG AGCAGGCGCT GGTGGTCGAC GATGAGAGCT GCACCGACGA TCCGAACGGA GATTGGACGT TCGCCGCCGC CGTCATCGAG TGCGGCATGG CGCTGCACCG CTCGCCCCAT GCCGGCGCCG CCACCCTCGA AAGCGCCCGC GACCTGCTGG CAAGCTGCGA GCTCACCGAC CAGCAGCAAG GCTTCGAAAC CCTCCTCGCC GACCTCGCCC GCCAAGAGGG AGCGCACGGG TCATGCAACC GGTACTGA
|
Protein sequence | MHASNRSRGR LAAGSIAFAV LIAGASLAGC SPDGQAGDQL GSAASESEIM AIGSALSETA STCPPPYPYV PSPSPGGTEE YRALDEPGFL SPATSPLSTL SADVDTASYC NLRRMVAQRY APAVVPAGAV RTEELLNYFD YAYPEPVGSD LFGVSAQMSD CPWNDQTKLL VMGFATEKDG DASPTGANLV FLIDVSGSMD DPDKLPLVKD SFAALVEGLT ERDRVSVVTY ASGERVLLEG VPGDDKRRIM RAVDSLVAEG STNGEAGLEQ AYRLAESSFI EGGVNRVVMA SDGDLNVGIS SESELHDFVE QKRETGVYLS VLGFGSGNYK DNKMETLADH GNGAYHYIDC AEEARRVLGR NLRANLVPLA DDVKIQVEFN PDRVKGYRLI GYENRALADE EFRDDAGEVG AGHAFTVAYE IVPAGSAFEV GASASKYGSD ADDRQDGRRS EANGGEWLTC TMRYRPAGTV EAVEQALVVD DESCTDDPNG DWTFAAAVIE CGMALHRSPH AGAATLESAR DLLASCELTD QQQGFETLLA DLARQEGAHG SCNRY
|
| |