Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3662 |
Symbol | |
ID | 8335015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4095803 |
End bp | 4101928 |
Gene Length | 6126 bp |
Protein Length | 2041 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956802 |
Product | von Willebrand factor type D protein |
Protein accession | YP_003114405 |
Protein GI | 256392841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.855021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGCCCA ACCTCGTCCT GCCGCCCTGT TCCCCCAACA GCACCGACAC CTCGATCCCG GACCCGGGCG GCCCGCGCTG CATCGCCGGG CTCTCGATGT CCACGCGGCG GACCCTGGGC TACGTGTACA ACCCGGCGAC CGGGAGCTAC ACCGTGCCCT GCATCAGCGA CCCGCGCGCC CTGATACCGG ACGCGACCTA CAGCAAGATG GACTGCAAGC CGTTGTATCT GGCGGGCGAG AACATCGCGC AGACGCGGGT GATCGCGCAG CTGAACGCGG CCGGCGTCTA CGGCGACAGC GCGGCGACGG GCATCACGGC CAACCTGCAG TGGGAGACGA TCCTGCCCGA CCCCGCGGCG CCGGACAAGA ACAACCGCCC GGACCTTCTC CTGTACGACC GCACCAAGGC GAACGGCCCG GTCGGGCTGG TGGAGATGAA GGGCAACTGG AACACCAAAG ACGACCCGGT GGCGGAGGTC AAGAAGTACG TCCAGGACTG GCCGGTGAGC AATCACCCGG CTGTGGAATA CCACTTCACG ACGCCGATCA GCGACGACTT CAAGATCCAG CTGGAACCGC CGTGCAAGGA CGACCCGACC AAGCACTCCT ACTACCAGTT CCACACCTAC AGCGACCCGG CCAATGACGG CGTGATCCGG ATCTCCCGGA CCTTCGTGGG CTGCCCGCCG CGCCAGAAGG GCAGCGGCCA GGACGAGCAG AACGGCGGCT ACCAGGGGAC CATCGGCGCC GACGCCGACC ACAACGGCGT CGACGACATC TGGGACTTCG TCAGGAACCA CCCGGAGCTG TGGTACCTGA CCACTCCGAT GCTCATCCCC ACCTTCCACC CCTTGTCCAA GCCGGTCATG GTCTCGCTGG ACCGCGACGC GCTGACGGAG CTGGAGGAGG CCACCGAGGA CGCCGCCCCG GACGCCATCG ACGCGGAATG GGCGGCGCTG GTCGACAGCA TGCCCGACGC GGTCGCCGAG GCGGCCGCCG ACATCGCCGC CGGCGACGCG ACCGCGCTGG GGTTGGACGC CACCGAGATG GCTCTGGCGG AGCTGGCGCT GGACGGCGGG TTCGTCACCC TCAGCGCGTT GTTCGTGGCA CCGTTGATCG CCGCGGCCGT CCTGGCGGCG ATCTGGGCGG TGATGCACTG GCACCTGTTC GGCGATCCGC ACATGACGAC CCTGGACGGA CTGGCCTACG ACCTCCAGGA CCAAGGGGAG TTCGAGGTCG TCCACGTCCC CAGCCTGAAC CTCGACGTCC AAGCCAGATT CCTGCCTTTG GGCGGTTCCA GAACGTTCAC CGTGGTGGAC TCGGTGGTCT TCACGATCAA CGGGTCCCTC GTCGAGCTGA ACCAGCATGG GGTCGCGACC GTCAACAGCG TGCCGATCCC CTCGTCGCAG ACGCTGACCG AGTTCGGCGA CGGTGCGGCG CTGGTCCGCA ACGGCAACAA GTTCGTGGCG ACCTTCGGCG CGGGGAACGC CCGCCTTGCC TTCGGTGACA GCAGCCTGGG CTTCGACATC GCCCCGGGCA TACCGACCAC CGGACTGCTC GGCAACAACG ACGGCATCCC GGGCAACGAC CTGGTCATGG CCGACGGCAC GCCCCTGACC TCGCCGACCG CCGCCACGAT CGACGGCACG TACGCGAACT CCTGGCGGCT CACGGACAAC GAGTCCGACT TCACCTACGA CGAAGACATG GACACCGCGG CGTACACCGA CCTCACGTTC CCCTCGAACG TGGTGACGCT GGCCGACTTC GCACAGTCCG ACCAGGACCT CGCGCGCCAG GCGTGCAACG CCCAAGCGGT CCCCCCGGGT CCGCAGTTCG ACGCCTGCAT GCTCGACGTC GCCCAGACCG GCGACGCGAA CTACGCCAAG GCCGCCGCGG CGGTGACCGA CGTCCTTCAG GACTACTCCG CGCACACCGT GGACGCCAGC GGCACCGTGA CCGAGAACTT CGAAGGCGCG GTCGGCAGCA ACTTCCGCCC CGACAGCACC GAGTCGATCG GCGGCACGAC CGCGGCCGGC CCGGTGTTCG ACGGCAGCGG TTACAGCTTC AGCGTCCCGT CGCTGCCGAA CCACTCCGGC GCCACGGTCG CCTTCGACGT CTACGCGGTC GGCATCACCA GCGCCAACGC CCAGAACCAG ACCCTGACGG TGAAGGTCGG GGATCTGGCG ACCACCGCGG TACTCGCCTT CACCCCGACG GCGGCGAGCG TGTCCTCGGG CTCTGCGACG GTCACCGCGC TGGGCCAGGG CCAGACCGCC CAGGGCGCGC CCTACCAGCG CTACCGGGTC ACCATGACGA CCCCGCAGTA CAGCGACGAG ATGCGGGTGC AGCTGACGCC GTCGGGCTTC CGGGGCATCA TCGGCACCAG CCTGGCCGTC GACAACATCT CCGTAGGCGT CACGCTCGTC CCGGCGCAGA CGTTCGCCGC GGCACTGCCG CTGGCGGTCT CGGCCGGCAC CGTCGACGGC GCTGCCGCGG CGGGCGCCGG AACGCTGGAG AACACCGGAT CGGCGGATGT GTACTCCTTC ACCGTCCCGG CCGGCGGCCA GCACCTGAAC CTCAGCATCG GCTCGTGCCC CGCTGAGGGG CAGTCCGACG GCATCTCCTG GACCCTGGCG ACCGCGGCGG GCCACACGAC CGCCTCAGGG GTCTGTCTCG ACCGGGACCT CGGGCTCGTG GCGGCCGGAC AGTACACGCT GACCGTCCGC GGCCCGGGAC TCGTCGGACC GTACACGGTC AACATGGAGG CTCCGCAGTC ATTCACGGCG ACCTTGCCTC TGGCGGTGAC GGCGAACACC CTGAACGGGA CCGCCACCAC CGGGGCGGGG GTGTTCGAGG ACGGCGCGTC GCAGGACATG TACTCCTTCA CCGTCCCGAC CGGCGGCAAG CAGCTCGCGG TGAGCCTGCG CTCGTGCCCG GCGTCGGACA ACTACTCCCC GGGCACCTGG AAGCTGCTCG ACGCCGCGAC GCAGACCGTG GTCCACTCCG GCTACGGCGG GTGCTCCTAC GCGGACTTCG GGACGCTGCC GGCCGGCTCC TACACGCTGC TCGTCGCGGC CAACGGCACG CCGGGCCCGT ACACCCTGGA CCTGTTGTCG CCGCAGTCCT TCACGGCGAC CCTGCCGCTG ACGACGACGG CGAACACCAT CAACGGGACC GCGACACCGG GCGCGAGCGA CTTCGAGACC GGCGCTTCGC AGGACACGTA CACCTTCACA GTGCCGACCG ACGGCCAGTT CCTGGACCTG GACATCACGG CGTGCCCGAC GGCCGGCTAC TCCACGCCGC TGCGCTGGAA GCTGATCAAC ACCGCGACCG GAGCCAGCGC CGCCAACGGA AACTGCTCGT ATTCCAGCCT CGGTCCACTG GCGGCGGGCG GCTACCAACT TCTGGTCAGC GCCGGGGGCG TGGCCGGCGG CTACGCGCTG AACCTCGAGG CGCCGCAGTC GTTCGCGGCG ACCTTCCCGC TGGCGGTGTC GCCGAACGTG GTCAACGGCG CGGCGGCGAC CGGCGCGGGC CGGTTCGAGA CCGTCGCGTC GCAGGACATG TACACGCTCA CCGCGCCGTC CGACGGCTCG CCGGTGCTGC TGGACATCGC CTCGTGCCCG ACGCTGGACT ACAGCACCAC GCTGACCTGG CGGCTGCTCG ACAGCTCCGG CACGGCGATC GCCCACGGCA AGTGCGGCGT GGCCGGCCTC GGGGTCCTGG CCGCCGGCAG CTACCGGCTG GCGGTGGACT CCGGCGGGAT GATCGGGACC TACAGCCTGT TCGCCAGCGC GGGCGGCGCG GGGACCCCGG CGGCGACCTT GGACGGCACG CCGGACGTGG TCACCACCAC CGTCGCGGCG CAGTCGGTCG CGATCGGCTT CGTCAACCCG ACGAGCCAGA CCGTCGCGGT GACCGGCTCC TCCACCCTCA CCAGCGGTGA CTGCGACTAC TCCTCGCTGG TGTTCTACCT GTACGACCAC ACCGGCGCCG AGGTCACCCA CACCGGCCTG CAATGCGGCA GCGCCGGGAT GCTCTACTCG CCGGTGCTGC CGGCCGGGTC CTACACCCTG CTCATCGTTC CGCCGGCGCC GGTCACCGGG AAGCTGGGCG TGCAGATCTT CGGCGCGTCC TCGACCGCGG TGACCGCGAC GCTGGACGGC GCGCCGGCGT CGGTGACGAC GACAGCCGCC TCCCAGTCGG CGGTGGTCAG CTTCACCAAC CCGGCGCTCC AGGCGGTGAC GATCACCGGC TCCGCGGCGA TCACCAGCGG GGACTGCGAC TACAGCTCGG TGAAGTTCTA CCTCTACGAC CACTCCGGTA CGCAGGTGAA GAGCGGGAGC CTGAGCTGCG GCAGCGCGGG CGTGCTGTAC TCCCCGACGC TGGGCGCGGG GTCCTACATG ATGTTGATCG CCCCGCCCGG GCCGGTCACG GGTACCTACG GCGTGCAGGT CTTCGGCGCC GGGTCGTCGA TGGCGACCGC GGCACTGGAC GGCACGCCCG CCTCGGTGAC CACGACCACC GCCTCGAAGG CGGTCGCGGT CGGCTTCACC CTTCCCGCCG ACCAGGCGGT GACCGTCACC GGCTCGACAA CGATCACCGG GGACTGCGAC TACAGCTCGG TGAAGTTCTA CATCTACGAC CACTCCGGCA ACCAGGTGAG CAACGCGAGC CTGAGCTGCG GCTCATCCGG CGTGCTGTTC GGCAAGGCCC TGACCGCCGG GTCGTACACG GTGCTGGCGG TCCCGCCGGG ACCGGTGACG GGCAGCTACG GCGTCCAGGT CTTCGGCGGG AGCGTTCCGG CGGTCGCCAC GCTGGACGGG ACGCCCAAGG CGGTGACCAC CACGGTCGCC TCGCAGGCGG CGGCGGTGCG CTTCACGGTG CCGAGCAGCC AGACGGTGAC GATCGCCGGA TACGCGACGA TCACCACCGG CGACTGCGAC TACGCCTCGG TGAAGTTCTA CCTCTATGAC AGCACCGGTA CCCAGCTGAA GAACCAGAGC GTCAGCTGCG GCTCGTCCGC CGCGCTGTAC GGCGCGACGC TGGCGGCGGG CTCCTACACG CTGGTCGCCG TCCCGCCGGG CCCGGTGGCC GGTACCTACG GCGTCCAGGT CTTCGGGGCG ACACCGCCGG CGACCGCCGC CCTGGACGGC ACGCCGACCT CGGTGGCCAC CACGACCGCG TCCCGGTCGG TGGGCATCGG ATTCACTGTG GCGACCGCCC AGACCGTCAC GATCACCGGC TCCTCGGCGA TCACCGGCGA CTGCAACTAC GCGTCAGTGC ACTACTACCT GTACGACCAC ACCGGCGCGC AGCTGAACAA CATGAGCCTC AGCTGCGGGA GCGCGGACGT TCTGTTCGCC CCGACCCTGG CAGCCGGTTC CTACCTGCTG CTGATCGTGC CGCCGGGCCC GGTGACCGGC ACCTACGGCG CCCAGGTCTT CGGCGCCACG GCGGTATCGG CCACGGCCGC GCTGACCGGC GCCCCGGTCT CGGCGACCAC CACGATCGCC GGCCGCGCGG CGGCGTTCGG CTTCACCGTC CCGACCAGCC AGGCCGTCAT GATCGCCGGT TCCTCGACGA TCACCGCCAA CTGCAACTAC ACGTCAGTGC ACTACTACCT CTACGACAGC ACCGGCACCA AGGTGAAGAA CGTCAGCCTC AGCTGCGGCA GCGCCGGCCA ACTCTTCACC GCGACCCTGG CCGCCGGCTC CTACAAGGTG GTGATCGTCC CGCCCGGTCC GCAGACCGGC AAGTACACCG CCCAGGTCTT CGGCGCGACC GCCTCATCGG CGACCGCGCC CACCACCGGC ACGCTGACGT CGGTGAAGAC GACCGTCGCG GGCCAAGCCA CCTCGATCGC CTTCACCACC ACGGCGGCCA AGAACGTGAC ACTGGCCGGC GCCACGACGA TCACCGGAGG CACCTGCGGC ACCACATCAA TGGCCTACTA CCTCTTCGAC CACACCGGTA CCCAACTCAA GAACGGCAGC CTCGGCTGCG GCAGCACCGG CACGCTGTAC ACGATGACGG CACTGCCGGC CGGCTCCTAC ACGGTACGGA TCGTGCCGTC GGGCCTGGTG TACGGATCGC TGGGGGTGAA GGTGACCGCG GCCTGA
|
Protein sequence | MVPNLVLPPC SPNSTDTSIP DPGGPRCIAG LSMSTRRTLG YVYNPATGSY TVPCISDPRA LIPDATYSKM DCKPLYLAGE NIAQTRVIAQ LNAAGVYGDS AATGITANLQ WETILPDPAA PDKNNRPDLL LYDRTKANGP VGLVEMKGNW NTKDDPVAEV KKYVQDWPVS NHPAVEYHFT TPISDDFKIQ LEPPCKDDPT KHSYYQFHTY SDPANDGVIR ISRTFVGCPP RQKGSGQDEQ NGGYQGTIGA DADHNGVDDI WDFVRNHPEL WYLTTPMLIP TFHPLSKPVM VSLDRDALTE LEEATEDAAP DAIDAEWAAL VDSMPDAVAE AAADIAAGDA TALGLDATEM ALAELALDGG FVTLSALFVA PLIAAAVLAA IWAVMHWHLF GDPHMTTLDG LAYDLQDQGE FEVVHVPSLN LDVQARFLPL GGSRTFTVVD SVVFTINGSL VELNQHGVAT VNSVPIPSSQ TLTEFGDGAA LVRNGNKFVA TFGAGNARLA FGDSSLGFDI APGIPTTGLL GNNDGIPGND LVMADGTPLT SPTAATIDGT YANSWRLTDN ESDFTYDEDM DTAAYTDLTF PSNVVTLADF AQSDQDLARQ ACNAQAVPPG PQFDACMLDV AQTGDANYAK AAAAVTDVLQ DYSAHTVDAS GTVTENFEGA VGSNFRPDST ESIGGTTAAG PVFDGSGYSF SVPSLPNHSG ATVAFDVYAV GITSANAQNQ TLTVKVGDLA TTAVLAFTPT AASVSSGSAT VTALGQGQTA QGAPYQRYRV TMTTPQYSDE MRVQLTPSGF RGIIGTSLAV DNISVGVTLV PAQTFAAALP LAVSAGTVDG AAAAGAGTLE NTGSADVYSF TVPAGGQHLN LSIGSCPAEG QSDGISWTLA TAAGHTTASG VCLDRDLGLV AAGQYTLTVR GPGLVGPYTV NMEAPQSFTA TLPLAVTANT LNGTATTGAG VFEDGASQDM YSFTVPTGGK QLAVSLRSCP ASDNYSPGTW KLLDAATQTV VHSGYGGCSY ADFGTLPAGS YTLLVAANGT PGPYTLDLLS PQSFTATLPL TTTANTINGT ATPGASDFET GASQDTYTFT VPTDGQFLDL DITACPTAGY STPLRWKLIN TATGASAANG NCSYSSLGPL AAGGYQLLVS AGGVAGGYAL NLEAPQSFAA TFPLAVSPNV VNGAAATGAG RFETVASQDM YTLTAPSDGS PVLLDIASCP TLDYSTTLTW RLLDSSGTAI AHGKCGVAGL GVLAAGSYRL AVDSGGMIGT YSLFASAGGA GTPAATLDGT PDVVTTTVAA QSVAIGFVNP TSQTVAVTGS STLTSGDCDY SSLVFYLYDH TGAEVTHTGL QCGSAGMLYS PVLPAGSYTL LIVPPAPVTG KLGVQIFGAS STAVTATLDG APASVTTTAA SQSAVVSFTN PALQAVTITG SAAITSGDCD YSSVKFYLYD HSGTQVKSGS LSCGSAGVLY SPTLGAGSYM MLIAPPGPVT GTYGVQVFGA GSSMATAALD GTPASVTTTT ASKAVAVGFT LPADQAVTVT GSTTITGDCD YSSVKFYIYD HSGNQVSNAS LSCGSSGVLF GKALTAGSYT VLAVPPGPVT GSYGVQVFGG SVPAVATLDG TPKAVTTTVA SQAAAVRFTV PSSQTVTIAG YATITTGDCD YASVKFYLYD STGTQLKNQS VSCGSSAALY GATLAAGSYT LVAVPPGPVA GTYGVQVFGA TPPATAALDG TPTSVATTTA SRSVGIGFTV ATAQTVTITG SSAITGDCNY ASVHYYLYDH TGAQLNNMSL SCGSADVLFA PTLAAGSYLL LIVPPGPVTG TYGAQVFGAT AVSATAALTG APVSATTTIA GRAAAFGFTV PTSQAVMIAG SSTITANCNY TSVHYYLYDS TGTKVKNVSL SCGSAGQLFT ATLAAGSYKV VIVPPGPQTG KYTAQVFGAT ASSATAPTTG TLTSVKTTVA GQATSIAFTT TAAKNVTLAG ATTITGGTCG TTSMAYYLFD HTGTQLKNGS LGCGSTGTLY TMTALPAGSY TVRIVPSGLV YGSLGVKVTA A
|
| |