Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4775 |
Symbol | |
ID | 8336129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5437065 |
End bp | 5438912 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957875 |
Product | von Willebrand factor type A |
Protein accession | YP_003115477 |
Protein GI | 256393913 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0165447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGGCA GGCATCGGTC CTATGAGGGC CCCGGCAACA CCCCCCGCGG CCGCGGCGGC GGCTCCGGCG GATCGTCGTT CCCGACCGGG TTGGTCGCGA TCGGCGCGGT CCTCGTGCTC GCCGCCGGCG GCGGCTACGT GTATTACACG AAAAAACACG ACACCACGGC GACCGCGGGC AACAGCGGCA CGTCCTCGGC GGCCAACGGC TCGTGCGCGG CGCCCACCAC GCTGAACGTC GACGCCAACC CCGACGTCTA CACCGCGGTC AAGGCGGTCG CCGACGGCAT GGCGGACCCG TGCGTGCACG TCAACGTCAG CAGCGCCGAG GCCTCGGCGG TCGAGGCGTT CCTGGCCGGC AGCGCCAAGG GCGGGGACGT CACCAGCGCT CCGGACGTGT GGATCCCGGA CAGCAGCATG TGGATCGACA TCGCGCACAC CGGTGGCGTG AAGTCGCTGG CCGCCAACCC CGCGCCGGTG GCGACCAGCC CGCTGGTGAT CGGGATGCCC AAGCCGGTGG CCGCAGCCGC CGGCTGGCCG GCCAAGCCCT TCGGCTGGGC GGACCTGCTG GCCAACTTCA AGACCACCAA GCTGCAGACC GCCGTTCCGG ACCCGACCAC CTCAGGGCCC GGACTCGCGG CGATCACCAT GCTGCGCGCG GCCGTGCTCG GCCCGGCCGG GACCGACAAG GCCAAGCAGA GCCAGGCGCT GCAGAACCTC ACGCTGGTCT ACCGGGTCAT GAGCACCTCG GTCTCCAGCT CGATGAGCGC CCTGCTCACC GGGCTGCCGA CGCAGGGTGC CACCGCGGCC GGAGCCGGCG GTATAGCGGC GTTCCCGTCC ACCGAGCAGA AGATCGCGGC GTACAACACG GCCAGTCCCG CCACGCCGCT CGTCGCGCTG TATCCCTCGG ATATGGGCAC GATGATGATG GACTACCCGT ACACCATCAG CTCCACCCTG GACGCCGCGC ACGCCAAGGC CGCAGCGGAC TTCCAGACGC TGTTGCACAG CCCTGCGGCC GTCAACACCC TGCAGAAGGC CGGCTTCCGC GATCCCAAGG GCGCCGCCGC AGGAATCCTC ACCTCCGCGA ACGGCGTCAA CCCGGCGGTA CCGGCGCTGG CTCCGGCGGA CACCACCCAC ACCGCCGCCG GCTCGGCGCT GTCGGTGTGG AAGGTGACCA GCGAGCAGAC CCGCGGCCTG GTGGTCATGG ACGTCTCCGG CTCGATGGGC CTGACCGTGG ACGGGCAGGT CGACCCGAAT ACCCACACCC CGCTGAGCCG GCTGCAGATC ACCGCCGCGG CGTGCCTGAC CGGGCTGCCG CTGTTCGGCG ACAGCTCGCA GCTGGGCCTG TGGACGTTCA CCACCAAGAA CACCGCGGAC GGCGGCGGGA CCGTGCACAA GGAGCTGGTC CCGATGGGCC CGCTGTCGGC ACCGGTGGGC GCCTTCCCCA GCCGGCGCGC GGCGCTGAAC GCGGCGCTGG GACAGCTGAG CATCCAGCCG GGCAGCCGCA ACGGGCTCTA CGACACCATC CTGGACGCCT ACCAGACGGT GCTGACCGGC TGGGCGCCGA ACGAGTCCAA CGCGATCGTG GTCTTCACCG ACGGCAAGGA CGACGGCCTG AACTCGATGA GCGCCGACCA GCTGATCACC AAGCTGAACG CGCTCAAGGC CGCGAACCCG AACCACCCGG TCCGGGTCAT GATCGTGGCC CTGGGCAGCG GCGTGGACCT CACCACCCTG TCGAAGATCA CCGGCGCCGC CAACGGCCAG GCGCTGCACG CCGACACCCC CGCCGACATC GGCTCGGCGG TGATCGCCGG CTTCGCGGGC CGCCTGTCCG ACCAGTGA
|
Protein sequence | MAGRHRSYEG PGNTPRGRGG GSGGSSFPTG LVAIGAVLVL AAGGGYVYYT KKHDTTATAG NSGTSSAANG SCAAPTTLNV DANPDVYTAV KAVADGMADP CVHVNVSSAE ASAVEAFLAG SAKGGDVTSA PDVWIPDSSM WIDIAHTGGV KSLAANPAPV ATSPLVIGMP KPVAAAAGWP AKPFGWADLL ANFKTTKLQT AVPDPTTSGP GLAAITMLRA AVLGPAGTDK AKQSQALQNL TLVYRVMSTS VSSSMSALLT GLPTQGATAA GAGGIAAFPS TEQKIAAYNT ASPATPLVAL YPSDMGTMMM DYPYTISSTL DAAHAKAAAD FQTLLHSPAA VNTLQKAGFR DPKGAAAGIL TSANGVNPAV PALAPADTTH TAAGSALSVW KVTSEQTRGL VVMDVSGSMG LTVDGQVDPN THTPLSRLQI TAAACLTGLP LFGDSSQLGL WTFTTKNTAD GGGTVHKELV PMGPLSAPVG AFPSRRAALN AALGQLSIQP GSRNGLYDTI LDAYQTVLTG WAPNESNAIV VFTDGKDDGL NSMSADQLIT KLNALKAANP NHPVRVMIVA LGSGVDLTTL SKITGAANGQ ALHADTPADI GSAVIAGFAG RLSDQ
|
| |