Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2023 |
Symbol | |
ID | 8603350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2385919 |
End bp | 2387679 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003299628 |
Protein GI | 269126258 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000927527 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGCGGAA CCGGCTCTGG TGACATCTTC GGGCGCGCGG CCGCGCCGCC CCGGCGCCGC ATCCCACGGC CGGGCCTGCG CACCGGTGTG GCGCTCCTCG CCGCCGCGCT GGCGGTGGCC ATCGGCTGCC GCACGGTGCT GAACGCGGCC GGCTGCCGCG GCGAGGGCGC GGGCGGGCCG CTGACCGTCG CCGCTGCCCC CGACATCGCT CCCGCCCTCA CCAGGGCGAT CGACCGCTTC AACGAGGCCC AGGACACCTG CGTGCACGCC CTCGTGCGCC CCGTCGACCC GGCCGCGATA GCGGTGCTGC TGTCCGGCCA GGGCGCCTCC GGCTCCCTGC AGCGGCCGGA TGTGTGGATC CCCGACTCCT CGCTGTGGAC GTCCCTGGTG AACACCTCAC CGGAGCGGAT CGGCCCGCTG CAGGTCAGCC ACGGCCCGCT GGCCTACAGC CCGGTCGTGC TGGGCCTGCC GCAGGGGCTG GTGGACGAGC TGAGAAACCG CGGCGTCACC GCCGACCCCT CCTGGAACCT GCTGCTCGGC GCCGTCCCCG GCGTCCCGGG CGGCGCCTCG GCGGTGCCGC CCGGCCTGGT GCGCCCGCAG GTGCCCGACC CCACCCGCAG CGCCACCGGG ATGAGCGCCC TGGTCGTGGC CGACCGGCTG CTGACCGGCC GGTCCGGGCG GCAGGAGATC TTCACCGCGC TGGTGCGCGC CGTGCGCGAG AACACCGTTC CCTCGGTGGA GGCGGAGTTC CGGTCGCTGG ACGGCCGGGA GCGCAAGCGC CACCCCGTGC TGCTGGTCCC CGAACAGGCG ATCTTCTCCC ACAACGGCAA CCGCCCGGCC GAACGGATCA TCGCCCTCTA CCCGCGCGAG GGCACGCTGT CGCTGGACTA CCCGTTCGCC ATGACCACCG GCGAGGCCGC CCGGCTGGAG GCGGCCCGGG CGCTGGAGCG GGCACTGCGC AGCCCGGTCG CCGCCGCCGA GCTGCGCCGG GCCGGGTTCC GCGGCCCCGA TGGCCGCAAC GTCCCGCACT TCGGCCCGTT CACCGGGGTC AGGCTGGACC CGCCCCGCCG GCTGCCCGCT CCGCCGCCGC AGGCCGTGCG GGATCTGATG CAGACCTGGT CCAAGCTGAC CTTGAGCACG CGGATGCTGG TGCTGTTCGA CGTCTCCGGC TCCATGCGGC GGCGGGTCGC CCCCGGCCTG AGCCGGCTGC AGGCCACCGC CCGCGTCGCC CAAAGCGGGC TGCCGCTGCT GCCCGACGAC AGTGAGCTGG GCATCTGGCT GTTCTCCACC GACCTGGAGG GCGGGCGGGA CTGGCGCGAG GTGGTCCCGG TGGGGCCGCT GGGCGAGCGG GTCGGCTCGG TCACCCGCCG CCAGCTGATC TTGTCGGAGC TGGGCCGCAT CCGGGCCGAG CGCAAGGGGC GCACCGGGCT GCACGAGTCG GTGCTGGCGG CCGTCCGCCG GATGCGGGAG GGCTACAAAC CCGAGATGGT CAACACCGTG CTGGTCTTCA CCGACGGCCG CAACCAGGAC GCCGACGGCC CCACCCTGGC GCAGACCGTG GCGGCGTTGC GCCGCGAGCA CGACCCCAAC CGCCCGGTCC AGCTCATCAT CCAGGGCTAC GGCCCCGACG TGTCGGTCCC CGAGCTGCGC GCCCTCACCG AGGCCACCGG CGGCCTGGTG CAGATCGCCC GCACTCCCGA GGACGCCGGC AGGCTCTTGC TGCAGGCCAT GTCCCGCCGC ATCTGCTCAC CCGAGTGCTG A
|
Protein sequence | MGGTGSGDIF GRAAAPPRRR IPRPGLRTGV ALLAAALAVA IGCRTVLNAA GCRGEGAGGP LTVAAAPDIA PALTRAIDRF NEAQDTCVHA LVRPVDPAAI AVLLSGQGAS GSLQRPDVWI PDSSLWTSLV NTSPERIGPL QVSHGPLAYS PVVLGLPQGL VDELRNRGVT ADPSWNLLLG AVPGVPGGAS AVPPGLVRPQ VPDPTRSATG MSALVVADRL LTGRSGRQEI FTALVRAVRE NTVPSVEAEF RSLDGRERKR HPVLLVPEQA IFSHNGNRPA ERIIALYPRE GTLSLDYPFA MTTGEAARLE AARALERALR SPVAAAELRR AGFRGPDGRN VPHFGPFTGV RLDPPRRLPA PPPQAVRDLM QTWSKLTLST RMLVLFDVSG SMRRRVAPGL SRLQATARVA QSGLPLLPDD SELGIWLFST DLEGGRDWRE VVPVGPLGER VGSVTRRQLI LSELGRIRAE RKGRTGLHES VLAAVRRMRE GYKPEMVNTV LVFTDGRNQD ADGPTLAQTV AALRREHDPN RPVQLIIQGY GPDVSVPELR ALTEATGGLV QIARTPEDAG RLLLQAMSRR ICSPEC
|
| |