Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1366 |
Symbol | |
ID | 4068842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1657456 |
End bp | 1658931 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983375 |
Product | von Willebrand factor, type A |
Protein accession | YP_590442 |
Protein GI | 94968394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.808281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACAG TTTCGAAATT GCGCAACTCT GTATTGGTTT TTGCCGCGTG CATGACTGCA TCCGGAGTCT CCCTCGCGCA GAACGAACTC GATCTCTATA CCACTGCCGT GCGCCAAAGC CGCATCTCCG ACCGGAGTGC GTGGATGGCG CGGTTTCTAA AGGAACATCC GCAGAGTGAT TTGCGCGAGG ATGCGCTTGA AGTCCTGGTA TGGGACGCGA TGGAAAGCGG CCAGCGTGAC CAGTCGCGCC AATATGCGCA AGAGTTGCGG CAGATTGATC CGCACAATGC GCTGGCGATG GCGGTCGTGG CTGAAACGCG CGCAGAGACC ACCGGTCGCG CCGACAAGAA AGCGGCGGCA CAGGCTTTTG AAATAGCGAA GGCCGGGATC CAGGTGTATC CGCAAATGCA TCGGCCGGAA GGCATGCGTG AGGGCGAGTT CATCCTGCTA CAGCGGCAGG TGGTCGCGGT GCTCGATGGC GAAGCAGGCC TTGGATATCT CGCGGATAAA GATTACGAAG CGGCGCGCCG ATATCTGCAC GAGGCGGTTG CCATTCGTCC ACAGGACCCG CGTTATCTTT ACGGACTTTC GTTGGCGCTG CTCGACGGGA AAGACGCGAT GCAGCAGGAG GGCTATCTCT ATCTCGCGCG CACAGTGAAC CTGACGCAAG GAACGCCGGC GGGACAGCAG ATCGCAAATT ACGCGCAGAA ACGCTTCGAA AAACAGGGCG GTACGACTGC ATCGTGGAAC GAGTATCTTG CCGCGGCGAC GACGCCGGGC ATGCCGCGAC GTGCGCCAGC GACGCAGCCT GAGGCCCCAA TAGTCGCGAA GAACGTGCCG CCAACGCGGC CGGGAGTGCA GCCGCAATCC GAACCGCGTG AGACGAACCC TGAGGAGATT CCGCAGCCGA CGTTTAAGCG TGAGTATGTG GCGCGCACGT CTCCCGTCTC GATGGGCATT TTGATTCAGA CAGAACACCT GACGAAGGAG AACCGCCGAC AGATCCTCGA CGCGCTTACC GACATGATCC GGCACCTGCG CAACGACGAT GAAGTGTTCA TCATGGCGTA TGGCAAGAGC CTGCAATTCG AGCAGGACCT CACCGGTAAT CCGAAGCTTC TGGAAGAAGC GATGGAGCAG ATCAAAGCGG AGAGCGGCAC TGCCCTGCTC GATGCCGTGG GCTTTGCTGC GGGACACCTG GAGCGCATTG CGACCAACAA GAACCGGCTG TTGCTGGTGA TTTCGGATGG GCGGAATACG CCGTCGAAGG ACAATCCGCT TACGCTCTCG CAGAAACTGA ATACGGTGCG GGTGGATTGC ATTGGGCTTG ATGTGGATGG CGATTCGGGG CGGCGTCAGT TGGAGTCGCT GGCGGCGTAT TCAGGCGGGC AGGTGAGTTT TGCGAGCGAC ACGCGGCAAC TGCGCACGGC GGCGGTGCAG ATGGCGGAAG CGATTGGGAT TGAGTTTCCG GATTAG
|
Protein sequence | MSTVSKLRNS VLVFAACMTA SGVSLAQNEL DLYTTAVRQS RISDRSAWMA RFLKEHPQSD LREDALEVLV WDAMESGQRD QSRQYAQELR QIDPHNALAM AVVAETRAET TGRADKKAAA QAFEIAKAGI QVYPQMHRPE GMREGEFILL QRQVVAVLDG EAGLGYLADK DYEAARRYLH EAVAIRPQDP RYLYGLSLAL LDGKDAMQQE GYLYLARTVN LTQGTPAGQQ IANYAQKRFE KQGGTTASWN EYLAAATTPG MPRRAPATQP EAPIVAKNVP PTRPGVQPQS EPRETNPEEI PQPTFKREYV ARTSPVSMGI LIQTEHLTKE NRRQILDALT DMIRHLRNDD EVFIMAYGKS LQFEQDLTGN PKLLEEAMEQ IKAESGTALL DAVGFAAGHL ERIATNKNRL LLVISDGRNT PSKDNPLTLS QKLNTVRVDC IGLDVDGDSG RRQLESLAAY SGGQVSFASD TRQLRTAAVQ MAEAIGIEFP D
|
| |