Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3008 |
Symbol | |
ID | 4071563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3566923 |
End bp | 3567948 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985027 |
Product | von Willebrand factor, type A |
Protein accession | YP_592083 |
Protein GI | 94970035 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.780877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000171728 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTCCAT TAAAAATGCG TTCCAGGGTC ATCGCAGTGT CGCTGGCGTT CGTGTGCGTT GTTGCCACCG GGGCGCAGGA GCCGGTGTTT CGCGCGCAAT CGAACGTGGT GATTGTGCCG ACGCTGGTGC GCGATGCTGA GCGCAATGCT GTCTATGGAC TGGCGGAGAA AGACTTCATC ATTGAAGACG ATGGGGTGCC GCAGACGGTG CACCTGGATG AAGCGTCAGA AGAGCAGCCG GTCTCGATTG TCGTAGTTCT GCAGCTTGGA CGCCGCGCCG ATTACGAGTT GCCGCGCGTC AAAGGATTGC GTTCCATGCT GTCGCCGCTG ATGGATGCGG GACACGCCCG GGTCGCGATG GTGACTTTCG ATCAGGATGC AACGCTTTTC CAGGACTTCA CGAGTGATTC CAACGTTTTC GAAAAACGGC TCGGCCAGTT GGAACCGGGA AACGGCGGAG CCGCGATTGT GGATGCGGTG CACTTCGGCG TGAAGTTGCT GAATGCGACG CCTAAAAATA GCCAGCGCGT GCTTCTGCTG ATCGGCGAGA CGCGCGACCA CGGCAGCACG AAGAAGCCTG AGGACCTGCT ACGCGAATTG GGAACGAGCA ACATCGTGAT CTACGCGCTC ACGTTCTCGC CGTCGAAGTC GAATGTGCTC GACACGCTGC GGGGGACAAA CAATCCCGAC CTGCATCCGG AGCAGAGCGA AGTGCACGAG GGTCCGGACC TGCTGGCGCC GCTGATACTG GCGGCCGAAG GCATGCGCAA GAACGCAGCG AAGACGATTA CGGCAATGAC CGGAGGAGAG TACACGCAGT TCGCGACGAG CAAGAACTTC GACCGCGATA TGAACGCGTT CTCGAACCAC CTGTACTCGC GTTACGTGTT GAGCTTCGCG CCAAGCAAGC CGCATCCGGG GTTGCATCAC CTGACCGTGA GACTGAGAGA CCCCGGTAAA GCCGCTGTGT TAGCGAGGGA GAGTTACTGG GCGGAAGGCG CGACCAGCGG AAATCCCCAA CCTTAG
|
Protein sequence | MSPLKMRSRV IAVSLAFVCV VATGAQEPVF RAQSNVVIVP TLVRDAERNA VYGLAEKDFI IEDDGVPQTV HLDEASEEQP VSIVVVLQLG RRADYELPRV KGLRSMLSPL MDAGHARVAM VTFDQDATLF QDFTSDSNVF EKRLGQLEPG NGGAAIVDAV HFGVKLLNAT PKNSQRVLLL IGETRDHGST KKPEDLLREL GTSNIVIYAL TFSPSKSNVL DTLRGTNNPD LHPEQSEVHE GPDLLAPLIL AAEGMRKNAA KTITAMTGGE YTQFATSKNF DRDMNAFSNH LYSRYVLSFA PSKPHPGLHH LTVRLRDPGK AAVLARESYW AEGATSGNPQ P
|
| |