Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3360 |
Symbol | |
ID | 9157535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3459576 |
End bp | 3461159 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003648283 |
Protein GI | 296141040 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.354576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGCC ATCACGCCCC CGGTGCCACA CCGGCGGCAA CCCCGCGCGG CCGCACCTAT CTGCGGTACG CGCTGGCCAT CGGTCTGGTC GCCCTGCTGG TCGCGGGACT CGTCGCCATC TTCGGCGGCA AGGTCGTCAG CTGCGGCGAG AAATCCACCT TCACCGTGAT GGCCGATCCC GCCCTCGTCC CGGCCGTGCG CGCCGAAGCG AAGGCGGTCA CCGACAAGTG CACCAGTTTC ACCGTCCGTG AGGTGCCCGA TGCGCAGGTC GCGGGGCATC TTACGGGCGG CGGCGAGGTC TCCGATCTGT GGCTCGCACC GTCGCACACC CGGGTGGCCG CCGTCTCGGC CCAGCTCGGC CGCGACCTGC CCACCACGGA GGTGGCCAGT TCGCCGATCG TGCTGGCCGG CGCCAGCATC CCCGAGCCCA CCAGTTGGCT CGACGCGCTG CAGGCGGCAA CGATCCGCGC CGTCCCCGCC GATTCGCCGT ACGCCACCGC GCCGGTGACC GCCGGGGTGT CCGAGGCGCA GCAGCCGGGC GCGAACCGGC AGGCCCTCAC TGCGGCCCTG GCCCAGTACG CGCAGGCCGC CAAGCGCGTC GACGACGATC CGGTCCGCTC CGCCAAGGCC CAGGGCGGGG TCGTCGTGGT CCCCGAATAC GCCTACCTCG CAGCGAAGAA GGACGAACCG GGCGTCTCCG CCGTCGTCCC GAAGAGCGGC GCCCCGCGCG ACGATCTGCT GCTCACCGTG ACCGCGGGCG GCGACCGCGC CACCGCCGCC AAGACCGGCG CCGACACCCT CGCCGCGGCC TTCGCGTCCG AGAAGGGCAT CGTCGCGCTC GGTGAAGCCG GCTTGCGCGG CAAGGACCTG AGCCCTGCGC CGCCCGACGG CATCGGGAAG GTCGCCGGGC TGCCCGAACC GAACACCGAC GAGCTCACCA AGGCCGAGCA GGCCTACGCC ACTCTCGCAG TGCCGCTCAA GGCGCTCGTC GTGGTCGACA CCTCCGGTTC CATGAACGAA TCCGCCGGCG ACACCACCCG CATCGGCATG CTCGCCTCGG GCTTCACCAA GGTGGTCACC CAGATCCCGG ACGCCAATGC CGTGGGACTG TGGACCTTCT CCATCGGCAG CGCCACCCGC CCCGACTGGA CCGAGGTGGT ACCCACGGCG CGGCTCGACG CGCGCCGCGG TGACAAATCT CAGCGCCAGG CACTGCTCGA CGGGGTGAAC GCACTGCCCC GCAAGGTCGG CGGCGCGACC GGCCTGTACG ACACCACACT GGCCGCGTAT CGCCGCGCGG TGGAGAACTT CGACCCGGCG TACTCGAACT CGCTGATCCT GCTCACCGAC GGCTCGGACG AGAAGCCGGG CGGGATGTCG CTCGATGATC TGGTCGCGCA ACTGCGCACG CTGGTCGATC CGGCGCGGCC GGTGAACATC CACACCGTCG GAATCAGTAA GGACGCCGAC TTGCCGGCCC TCAAGCGGAT CGCGGACGCC ACCGGCGGCA CCGCGCAGGA GGCCGACTCC GAGCAGCAGA TGCTCACCGA CTTCGTGACG GCGATCGCCA AGCGCGCCAA ATAG
|
Protein sequence | MGRHHAPGAT PAATPRGRTY LRYALAIGLV ALLVAGLVAI FGGKVVSCGE KSTFTVMADP ALVPAVRAEA KAVTDKCTSF TVREVPDAQV AGHLTGGGEV SDLWLAPSHT RVAAVSAQLG RDLPTTEVAS SPIVLAGASI PEPTSWLDAL QAATIRAVPA DSPYATAPVT AGVSEAQQPG ANRQALTAAL AQYAQAAKRV DDDPVRSAKA QGGVVVVPEY AYLAAKKDEP GVSAVVPKSG APRDDLLLTV TAGGDRATAA KTGADTLAAA FASEKGIVAL GEAGLRGKDL SPAPPDGIGK VAGLPEPNTD ELTKAEQAYA TLAVPLKALV VVDTSGSMNE SAGDTTRIGM LASGFTKVVT QIPDANAVGL WTFSIGSATR PDWTEVVPTA RLDARRGDKS QRQALLDGVN ALPRKVGGAT GLYDTTLAAY RRAVENFDPA YSNSLILLTD GSDEKPGGMS LDDLVAQLRT LVDPARPVNI HTVGISKDAD LPALKRIADA TGGTAQEADS EQQMLTDFVT AIAKRAK
|
| |