Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2505 |
Symbol | |
ID | 4069874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2961573 |
End bp | 2962865 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984522 |
Product | von Willebrand factor, type A |
Protein accession | YP_591580 |
Protein GI | 94969532 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.622363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.185219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCGC GAAGACTGGT TACTATGCAC CTTCGGAATA GTTCGTTCCC CCTCCGCGCG TTACCGCTCG TACTCCTAGG CGCCCTTGCG TTTGGGCAAA ACACCTCACA GCCGCAAAAT CAGAATTCCC AACCTCCGGC AGGCCAGCAG GGCAAGTCTG ACGGCGGGCT CATCATGCCG ATTGACGATG GTTCGCAGCC AGCGCAGGGG CAGCAGCAGC AGAATTCGGC GCAACCTCAA CCCGGCCAGC AGGGTAAGAC GGACAATGGC CTGGTGATGC CAATCGAGAA CGGCCAGTCC GAGGCGCCGG TTCCGAAGTC GCCGAACCAG CCGGGAGATA CGGTCAACGT TCCCGCGAGC AGTTCGCGCG GCAACGGGCA AAATCCCGAC TCCGAAGTCG GCGGCGTGTA CACGTTCAAG AAGCAGGTTG AAGAAGTTCG CCTGCACGCG ACCGTGGTCG ACGATCGGCA GCGGTTGATC ACGACCCTCG ATAAGACCTC GTTTACGGTT TACGAGAACG GCGAGCCACA GCAGATTACG TCATTCCGGC ACGAAGATAT TCCTGTCGCG CTGGGCGTTG TGATCGATAA CTCCGGCTCA ATGAGGGACA AGCGTCCGGC AGTGAATGCG GCGACGATCA ACCTCGTGAA AGCCAGCAAT CCAGAGGACG AGGTGTTCGT CGTGAACTTC AACGACGATT ATTATCTCGA CCAGGACTAC ACCGACAGCG TTGCGAAACT GAAAGAGGCA CTGGAGAAGT ACGAGACCCG TGGTGGCACG GCGTTGTACG ACGCGGTGCT GGCCTCGAAC GCGCACTTGA TGAAGGCTCC GAAGCTGGAG AAGAAGGTTC TGTTCATCGT TACGGACGGT GAAGACGATG CCAGCCTCAA TACGTTGGAG CAGACGATCC GCAAGGTGCA GCAGGAGAAC GGGCCGACGA TTTACACCAT CGGAATTCTG GATGAAACCG GTGGGCATAA GCGTCGCGCG CAACGTGCAC TTCGTGAGAT GGCGGAATCC ACCGGTGGCG TGGCGTTCTT CCCGCAGAGC CTCGACGAAG TGAGCCGGAT CACGCAGCAG ATCGCGCACG ATATCCGCAA CCAGTACACG ATTTCGTACA AGCCGACGAA TCCACAAGCG CGTGGTGGCT ATCGCCAGGT GAAGGTAGAG GCGAAGTCGA AGGGCTTCAA GGCCCTGCAG GTTCGGACGC GCGCGGGCTA TTACGCGGGA CAGACGCAGT CCGCGGCCAA TCCGCCGGTG CGTAAGCCGG AGACCAACGC GGCTGTGCGA TAA
|
Protein sequence | MFSRRLVTMH LRNSSFPLRA LPLVLLGALA FGQNTSQPQN QNSQPPAGQQ GKSDGGLIMP IDDGSQPAQG QQQQNSAQPQ PGQQGKTDNG LVMPIENGQS EAPVPKSPNQ PGDTVNVPAS SSRGNGQNPD SEVGGVYTFK KQVEEVRLHA TVVDDRQRLI TTLDKTSFTV YENGEPQQIT SFRHEDIPVA LGVVIDNSGS MRDKRPAVNA ATINLVKASN PEDEVFVVNF NDDYYLDQDY TDSVAKLKEA LEKYETRGGT ALYDAVLASN AHLMKAPKLE KKVLFIVTDG EDDASLNTLE QTIRKVQQEN GPTIYTIGIL DETGGHKRRA QRALREMAES TGGVAFFPQS LDEVSRITQQ IAHDIRNQYT ISYKPTNPQA RGGYRQVKVE AKSKGFKALQ VRTRAGYYAG QTQSAANPPV RKPETNAAVR
|
| |