Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3424 |
Symbol | |
ID | 5900879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3701396 |
End bp | 3703174 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563930 |
Product | von Willebrand factor type A |
Protein accession | YP_001685049 |
Protein GI | 167647386 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.236212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCAC GCCCCCTTCG TATCCTCGGT TCCACAGCCC TCTCCCTGCT GATCGCCGGA TCCGCGAGTG GACCCAGCCT GGCCCTTTCG GAGCCAGGCG ACGGCGTGGT GACACCCGAG GCCTGCGCGG CCCTCGGCTA CGCCGCGCCC GCTCCAGACG GTTACGATAG CGTCGTCGTC ACCGCGACCA AACGGACCTC CCGGGAGCAG AGACTCACGT CGAGAGCCCG GCCCGTGATC GCGACGCCCG GGCTGACGCC ACCACCGCCA CCGCCTCCGC CATCACCACC GCCGCCTCCG GCCGCCTACA GCTTCGCGGC CCCCTCCCCC GTCGTTGCGC CCAACTTCGC GCCGCCGATC CGCGACACCG AGAAATACCC CGGCGCGGCG GCCAACCCGG TCAAGCGCGT GGCCGAGGAA CCGGTCTCGA CCTTCTCGAT CGATGTCGAC ACCGCCGCCT ACGCCAATGT CCGCCGCTTC CTCAACGAGG GCGCCGCCCC GCCTCACGAT GCCTTGCGGG TCGAGGAGCT GATCAACTAT TTCGACTACG GCTACGCCAG GCCCACCGCC CAGGAGCCTC CCTTCAAGCC GACCGTGACC GTGGTTCCCT CGCCCTGGTC GCAGGATCGC CAGCTGATGC ACATCGGCGT GCAGGGCTAT GCGACGCCGC GCGCCGGCCA GCCGCCACTG AACCTGGTGT TCCTGATCGA CACCTCCGGC TCGATGTCCG GCCCCGATCG CCTGCCCCTG GCCAAGAAGG CGCTGAACGT GCTGATCGAC CAGCTTCGGC CGCAGGATCG GGTGTCGATG GTCGCCTATG CCGGTTCGGC CGGGGCGGTG CTGTCGCCCA CCGACGGCAA GTCGAAGCTC AAGATGCGCT GCGCCCTGAC CGCCCTGCGG TCCGGCGGCT CGACCGCCGG CGGCCAGGGG CTGGAACTCG CCTACGCCCT GGCCAGGCAG AACCTCGACC CCAAGGCCGT CAACCGGGTG ATCCTGATGA CCGACGGCGA CTTCAATGTC GGCATCGCCG ATCCGACGCG CCTGAAGGAT TTCGTCGCCG ACCAGCGCAA GAGCGGCGTC TACCTCTCGG TCTACGGCTT CGGGCGCGGC AACTACAACG ACACGATGAT GCAGGCCCTG GCCCAGAACG GTAACGGCAC GGCCGCCTAT GTCGACGGCC TGCAGGAAGC CCGCAAGCTG CTGCGCGACG ACTTCGACAG CGCCCTGTTC CCGATCGCCG ACGACGTTAA GATCCAGGTC GAGTTCAATC CGGCCAAGGT CAGCGAATAT CGGTTGATCG GCTACGAGAC CCGGCTGCTC AATCGCGAGG ACTTCAACAA CGACCAGGTC GACGCCGGCG AGATAGGCTC TGGCGCGGCG GTCACGGCGA TCTACGAGAT CACCCCGGTC GGGGCGAAGC CGTCGTCCGA TCCGCTGCGC TATGGCGCCA AGCCGTCGCC GGCGACGGGC GGGAGCGAGC TGGCCTTCCT GAAGATCCGC TACAAGCCGC CCGGCGGCTC GACCTCGAAG CTGATCGAGC GCCCGATCGG GGCTGGCGAT ATGCACGCCA GTCTGGCCGC GGCGCCGGAG GCCACCCGCT TCGCCGTCGC CGTGGCCGCC TACGGCCAGA AATTGCGCGG CGATCCCTGG GTCGACGCCA GCTTCGACTG GGACGCTGTC ACCGCCCTCG CCCAGGGCGC GCGGGGCGAA GACCCCTACG GCCTGCGCGC CGAGTTCGTG CAGTTGACCC GCGCGGCCAA GGACGTGAAG GGAAGCTAG
|
Protein sequence | MAPRPLRILG STALSLLIAG SASGPSLALS EPGDGVVTPE ACAALGYAAP APDGYDSVVV TATKRTSREQ RLTSRARPVI ATPGLTPPPP PPPPSPPPPP AAYSFAAPSP VVAPNFAPPI RDTEKYPGAA ANPVKRVAEE PVSTFSIDVD TAAYANVRRF LNEGAAPPHD ALRVEELINY FDYGYARPTA QEPPFKPTVT VVPSPWSQDR QLMHIGVQGY ATPRAGQPPL NLVFLIDTSG SMSGPDRLPL AKKALNVLID QLRPQDRVSM VAYAGSAGAV LSPTDGKSKL KMRCALTALR SGGSTAGGQG LELAYALARQ NLDPKAVNRV ILMTDGDFNV GIADPTRLKD FVADQRKSGV YLSVYGFGRG NYNDTMMQAL AQNGNGTAAY VDGLQEARKL LRDDFDSALF PIADDVKIQV EFNPAKVSEY RLIGYETRLL NREDFNNDQV DAGEIGSGAA VTAIYEITPV GAKPSSDPLR YGAKPSPATG GSELAFLKIR YKPPGGSTSK LIERPIGAGD MHASLAAAPE ATRFAVAVAA YGQKLRGDPW VDASFDWDAV TALAQGARGE DPYGLRAEFV QLTRAAKDVK GS
|
| |