Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3257 |
Symbol | |
ID | 4072592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3858258 |
End bp | 3859505 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985278 |
Product | von Willebrand factor, type A |
Protein accession | YP_592332 |
Protein GI | 94970284 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.382216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.377188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTACA TCAAGTACAA AAAGTACGTT CCTAATCTGG CGGAAGAGAT CTCGATGGAA GATCTCATGA AGGCACTGTC GGATTACCTG CTACAGAGCG GGTTCGAGAA TCCGTATTCG GATTTCTACG ATGTGGGCGA CGAGCAGTCG ATGGAGCGGT TGAAACAGGC GATTGCGCAG GCTCTGATGA ACAGCGACCT CTTCGACGAA GAGATGAAGG AACAGCTGAA GCAGGCGCAG GCCGAAGGCG CTTTCGACGA GCTGTTGGAA AAGCTGATGA ACCGGATGGA GCAGGAGAAC TACATCACGG TGGATCAGCC ATTCGATCAA TCGCGGCAGT CGAGCGTGGG TGGGCAGGTT GGCGATGCGC AGGAGCAGGG CGAAGCAAAG TTCGAAGTCA CCGAAAAAGG GCTGGATTTC CTTGGCTATC GAACGCTACG CGACCTGCTG GGATCTCTGG GGAAGTCGAG CTTTGGACGG CATGACACGC GCGATCTGGC GACCGGAGTA GAAGCGAGTG GGTCATCGAA GCAGTATGAG TTTGGCGACA CGCTGAATTT GGACATCACG GCGACGCTGT CGAACGCAAT GCAGCGCGAA GGATTGCAGC TGCCGATTGA GATTGAGTAC AGCGATTTGC AGGTGCACCA GTGCGAGTAT CAGTCGTCTT GCGCGACGGT GCTGATGCTC GATTGCTCGC ACTCGATGAT CCTGTACGGC GAAGATCGTT TTACTCCGGC GAAGAAAGTG GCGATGGCGC TGTCGCAACT GATTCGGACG CAGTATCCCG GCGATAGCTT GTCGTTGGTG CTGTTTCACG ATTCCGCAGA GGAGTTGCCG ATCTCGCAGT TAGCGCGGGT GAAGGTGGGA CCTTACTACA CGAATACGCG CGAAGGGCTG CGGATGGCGC AGCGCATCCT GCAACGCCAG CGCAAGGACA TGAAGCAGAT CATCATGATC ACCGATGGGA AGCCCTCGGC GCTGACGCTG GAAGATGGAC GCATCTACAA AAATGCCTTT GGGCTGGATC CGCTGGTGGT AAGCCAGACG CTGGAAGAAG TCTCGAAGTG CAAGCGCGCG GGAGTGATGA TTAATACCTT CATGCTGGCG AGCGATTATG GGCTGGTGCA GTTCGTGCAG AAGGTGACGC AGATGTGCCG CGGGAAGGCG TACTTCACCA CGCCTTACAC GTTGGGACAG TATCTGCTGA TGGATTACAT GTCGCGGAAG ACGAAGACGG TGCATTGA
|
Protein sequence | MKYIKYKKYV PNLAEEISME DLMKALSDYL LQSGFENPYS DFYDVGDEQS MERLKQAIAQ ALMNSDLFDE EMKEQLKQAQ AEGAFDELLE KLMNRMEQEN YITVDQPFDQ SRQSSVGGQV GDAQEQGEAK FEVTEKGLDF LGYRTLRDLL GSLGKSSFGR HDTRDLATGV EASGSSKQYE FGDTLNLDIT ATLSNAMQRE GLQLPIEIEY SDLQVHQCEY QSSCATVLML DCSHSMILYG EDRFTPAKKV AMALSQLIRT QYPGDSLSLV LFHDSAEELP ISQLARVKVG PYYTNTREGL RMAQRILQRQ RKDMKQIIMI TDGKPSALTL EDGRIYKNAF GLDPLVVSQT LEEVSKCKRA GVMINTFMLA SDYGLVQFVQ KVTQMCRGKA YFTTPYTLGQ YLLMDYMSRK TKTVH
|
| |