Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3298 |
Symbol | |
ID | 9157472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3390046 |
End bp | 3392037 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003648222 |
Protein GI | 296140979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACA GACCGTCTAT GGCTAGAGGC TCGCGTCGGC GCCATCTGGC GCAGTACCGC CGCTATACCG GCGGGCCGGA TCCCCTGGCG CCGCCGGTCG ATGTGCGCGA TGCTCTCGAT GAGATCGGCC GCGACGTGAT GAACGGCGTC TCGCCGCGCC GCGCCCTGCA GGAGTACCTG CGGCGAGGCA GCGCGAACCG CCGCGGCCTC GACAAACTCG CCGAGGCCGC CAACCGCCGG CGCCAGGAAC TGTTGCACCG CAACAATCTG GGCAAGACCT TCGAGGACAT CCGCGCACTG CTCGACGAGG CGGTGCTCGC CGAGCGCAAG GAGCTGGCGC GCGCCCTCGA CGACGACGCC CGGTTCCAGG AGATGCAGAT CAACGATCTG CCCCCGTCCG CGGCGCAGGC CGTGCGCGAA CTCGCCGACT ACCCGTGGCG CTCGCCCGAA GCGGCGCAGA AGTACGAGCA GATCCGGGAC CTGCTCGGCC GGGAGCTGCT CGACCAGCAG TTCGCCGGAA TCAAGCAGGC ACTCGAGGGC GCGACCGACG AGGACCGCGC GGCGATCCAG AAGATGCTCG ACGACCTGAA CACTCTGCTG GACAAGCACA ACGCGGGCAC GGCGACGCAG AACGACTTCG AGGAGTTGAT GGCCGAGCAC GGTCAGTACT TCCCGGAGAA CCCGCGGAAC ATCGACGAGC TCATCGATGC CCTCGCGCGG CGAAGCGCCG CGGCACAGCG GCTCTACAAC AGTCTCAGCC CGGAGCAGCG CGCCGAACTG GAGGAGCTGT CGCAGGCCGC CTTCGGTTCG CCGCAGCTCT CCCAGGCGCT CGGCGATCTC ACCAGCAAGT TGCAGCAGGC CCGGCCCGGG GAGGACTGGA ACGGTAGCCA GGAGTTCGAC GGTGAGCAGG GCATGGGTCT CGGCGACGCC ACGCAGGCCA TGCAGGACAT CGCCGATCTG GAATCCCTTG CCGAACAACT GAGTCAGCAG CACAACGGAG CCGCACTCGA CGACATCGAC CTCGACGCCT TGCAACGCCA GCTCGGCGAC GAGGCCGTCG CCGACGCCCG CACCCTCGCC GAACTGGAGA AGGCGCTCAA GGACCAGGGC TACTTCTCCC GTGACCCCGA CGGCGCGCTG CGCCTGTCGC CCAAGGCGAT CCGGCAACTC GGCCAGTCCA TCCTGCGGGA CGTCGCGCAC CGGCTGACCT CGCGCGGCGG CGAGCGCAAC ACCGCCCGCA GCGGCGCCAC CGGCGAGGCC ACCGGCGCCA GCCGGGAGTG GCGCTTCGGC GATACCGAAC CCTGGGACGC CACCCGTACC GTGCTCAACG GCGTACTGCG CCAGGCGGGA GAAGGGGTCT CCGGCCCGGT GGCACTGGAC GCCCGGGACA TCGAGGTCAC CGAGACCGAG GCGCGCACAC ACGCCGCCGT CGCACTACTG GTGGACACCA GCTTCTCCAT GGTGATGGAG GGCCGGTGGC TGCCGATGAA GCGCACCGCG CTCGCCCTGC ACCAGCTCAT CTCCACCCGC TTCCGCGGCG ACAAGCTGGC CCTGATCAGC TTCGGGCGGC ATGCCCGCAC GCTCACCGTC GATGAGCTCA CCGGCCTCGA CGGCGTGTAC GAACAGGGCA CCAACCTGCA CCACGCACTC ATGCTGGCGC AGCAGCACTT CCGGGCGAAC CCGAACGCCC AGCCGGTGTT GCTGGTGGTG ACCGATGGTG AGCCCACGGC GCACCTGGAG CGGGACGGCG AGGCCTACTT CGACTACCCG CCGCACCCGC GGACTCTGGC GATGACGGTG CGCGGTCTCG ATGCCGCCGC GGCGGCGGGA GCGCAGATCA CCGTCTTCCA GCTGGGCGAT GATCCGGGCC TGACCCGGTT CCTCGACGCC GTGGTGCGGC GCGTGGGCGG ACGGCTCGTC TCCCCCGATC TGGACGGGCT CGGTGCGGCC GTGGTCAGCG ACTACCTGGG TAGCCGCCGC AAGCGCGGCT GA
|
Protein sequence | MTDRPSMARG SRRRHLAQYR RYTGGPDPLA PPVDVRDALD EIGRDVMNGV SPRRALQEYL RRGSANRRGL DKLAEAANRR RQELLHRNNL GKTFEDIRAL LDEAVLAERK ELARALDDDA RFQEMQINDL PPSAAQAVRE LADYPWRSPE AAQKYEQIRD LLGRELLDQQ FAGIKQALEG ATDEDRAAIQ KMLDDLNTLL DKHNAGTATQ NDFEELMAEH GQYFPENPRN IDELIDALAR RSAAAQRLYN SLSPEQRAEL EELSQAAFGS PQLSQALGDL TSKLQQARPG EDWNGSQEFD GEQGMGLGDA TQAMQDIADL ESLAEQLSQQ HNGAALDDID LDALQRQLGD EAVADARTLA ELEKALKDQG YFSRDPDGAL RLSPKAIRQL GQSILRDVAH RLTSRGGERN TARSGATGEA TGASREWRFG DTEPWDATRT VLNGVLRQAG EGVSGPVALD ARDIEVTETE ARTHAAVALL VDTSFSMVME GRWLPMKRTA LALHQLISTR FRGDKLALIS FGRHARTLTV DELTGLDGVY EQGTNLHHAL MLAQQHFRAN PNAQPVLLVV TDGEPTAHLE RDGEAYFDYP PHPRTLAMTV RGLDAAAAAG AQITVFQLGD DPGLTRFLDA VVRRVGGRLV SPDLDGLGAA VVSDYLGSRR KRG
|
| |