Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1866 |
Symbol | |
ID | 4073025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2244516 |
End bp | 2246402 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637983875 |
Product | von Willebrand factor, type A |
Protein accession | YP_590941 |
Protein GI | 94968893 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCGAA GCGTTTCCAT TTTATATTCG CTGCTTGTGC TGAGCGGATG GGCTTGTGCT CAGGTGCTGA GCTTTCCCGC GTCCAATCCA GCCGCCATTA CTACGGGCGC GGGGTTGAAC CTTGTCCAAG CCACGGTAAT GGACAGCAAT ATCGCGCTGA TCGATAGCTT GAATCATAGC CCATTAGAAA ACCCTACCAT TGCGCTTTCT AAGCTGGATC TGAAGGCACC GGGCAAAGCG CGTCAGGAGT ATGAAAAGGG GTATCAGGCG CTCGCGAAGA AAGACTTCAC CCAGGCCTTG GGTCATCTCG AAAAAGCGAC TGCGATCTAT CCGAGTTACG TGTCGGCGTT CAATGCGCTC GGAGCAGCTC ATTTAGGTCT GGGTCAAAGC GATGAGGCGC GCGCAGCGTT TGCCGAAGCA ATATCGCTCG ACGACCACCT GCCCAATTCT TATCTGAACA TGGGCTGCGC GGAACTGGCT CTCAAGGATT ACGCCGGCGC AGAGCGAGAC ATAACACAAG CTTCTTCCAT GGCGCCTCTT GATTTTCAGG TGAAAGCAGC CCTCGCATAC AGCCAATATA TGAACAACAA CTATCAAGCT GTGGTTGCCA CGGCGGATGA CGTACATGCC CGCAAACACA GTGGCGCTGC GCTGGTTCAT TTCTACGCCG CAGCTGCCTG GGATGCACAA GGAAACCCGG CGTATGCGCA GCGAGAACTC CGGCTTCTGA TGAAAGAGGA CCCAAAATCA CCAGCGGCGA TCCAAGCGAA AAGCTTGATG CAACAGTTGC AAGATGAAGG CGTTCACTCC AAGAAGACGA CCCGTGTTGA GAGCGGCGAC CTGACTCTCG TCTCGAAAGT CTCTCTCCAA GTACCGTCCG ACGACGAGGA TGCTGAGCAA AAGAAAAAGC AAGATCAGAA AGAGTTAGCA CAGCTCAACG ATGCGGATGC TCTGGACCGT ACCCAACAGG ATGCTGCTGG TGAGGGAGTA GCTTCGGTCG CCACGCCCGA GTCTGCAGGT GGCACGACCG GCTACACATT CCACGCCTCC ACCGATGAGG TGGCGGTCCT CTTTGCTGCG ACCGACCACG GAAGGGCCGT GCCTGATCTC GACGTGAAAG ACATCAAGCT GTTGGATGGC CGCCACGCTC CTGCGCTGGT CACCGGCTTC CGCAATGAGG CTCAGCTCCC CTTACGCATT GGATTGGTGA TTGATACCAG TGCTTCCATC GCGGGCCGAT TCAAGTTCGA GCAGGACGCG GCTGGCGAAT TTCTTCAGAG AGTCCTTACT GGCCCCGAAG ATCTCGGTTT CGTTGTCGGA TTCTCGAACT CCATTCTCAT GGCGCAGGAC TTTACGCACG ATTCGAAGCA AATTGCCCAC AGCATTCAGG CCTTTGCTCC CTCCGGTGGT ACAGCGCTTT GGGATGCAGT GAATTTCGCG GCGGAGAAAC TGGCTAGCCA TCCGGAGAGG CAGCCGGTGG CGAAGATCCT TATTGTCATC AGTGATGGAG AAGACAACTC AAGCGCCACC ACGGCAAAAC AGGCGATCCA ACGCGCCCAG AGTGAAGAGG TGGCGGTTTA CGCAATCAAC ACGCTTGAAA TTACGCAACG TTCGGAGGAG CCTCCGGTCG GCGTGCGCGC TCTGAAAACA CTGGCGGAGA TGACCGGCGG CGCAGCCTTC ACTCCCGGAT CAGTGCGGTG GCTCAACAGC AGCTTGAACG ATCTCCAGCA AGTCATCCGT AGTCGATATC TCATCACATA CAAGCCTTCA GGATTTAAAC GGGACGGCAG CTATCGCCGG GTGCAAGTAG CGGCAGAGAA AGATGGACGT AAACTGCATG TGGTCTCGCG CAGCGGCTAC TACGCGACAG AGAAGCCCGC GAATTGA
|
Protein sequence | MSRSVSILYS LLVLSGWACA QVLSFPASNP AAITTGAGLN LVQATVMDSN IALIDSLNHS PLENPTIALS KLDLKAPGKA RQEYEKGYQA LAKKDFTQAL GHLEKATAIY PSYVSAFNAL GAAHLGLGQS DEARAAFAEA ISLDDHLPNS YLNMGCAELA LKDYAGAERD ITQASSMAPL DFQVKAALAY SQYMNNNYQA VVATADDVHA RKHSGAALVH FYAAAAWDAQ GNPAYAQREL RLLMKEDPKS PAAIQAKSLM QQLQDEGVHS KKTTRVESGD LTLVSKVSLQ VPSDDEDAEQ KKKQDQKELA QLNDADALDR TQQDAAGEGV ASVATPESAG GTTGYTFHAS TDEVAVLFAA TDHGRAVPDL DVKDIKLLDG RHAPALVTGF RNEAQLPLRI GLVIDTSASI AGRFKFEQDA AGEFLQRVLT GPEDLGFVVG FSNSILMAQD FTHDSKQIAH SIQAFAPSGG TALWDAVNFA AEKLASHPER QPVAKILIVI SDGEDNSSAT TAKQAIQRAQ SEEVAVYAIN TLEITQRSEE PPVGVRALKT LAEMTGGAAF TPGSVRWLNS SLNDLQQVIR SRYLITYKPS GFKRDGSYRR VQVAAEKDGR KLHVVSRSGY YATEKPAN
|
| |