Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1053 |
Symbol | |
ID | 8602363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 1202047 |
End bp | 1204014 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003298677 |
Protein GI | 269125307 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.463528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGAT ACCGATACGG CGCCTACCAA GGCGGCCCGG ACCCACTGGA GCCGCCCTAT GACATCCGCT CGGCGCTGGA CGCCATGGGC GATTCGGTGC TGGAGGGCTC CAGCCCCGGC GAGGCGCTGC GCGCGCTGCT GCGCCGCGGA CTGCCGGGCG GCCGGGACCG GCGGGCCCGG CGCGGCCTGG ATGAGCTGCT GCGGCAGGTG CGGCAGCGGC GGCGGGAGCT GCGCGAGCGG GGACGGCTGG ACGGCGTCCT GGAGCAGGCG CGGGCGCTGC TGGACACCGC CATCGGGCAG GAACGGGCCG AGCTGTTCCC CGACCCCAGC GACGAGGCGC GCATGCGCGA GGCCGAACTG GACGCGCTGC CGTCCGACAC CGCCCAGGCG ATCCGGCGGC TGAGCGACTA CGACTGGCGA TCGCCGGCCG CCCGCGCCAC CTTCGAGCAG CTGAAGGACC TGCTGCGGCG CGACGTGCTG GACGCCCAGT TCCAGGGCAT GCGGCAGGCG CTGGCCAACC CCGACCCGCA GGCCATGCAG CGGGTCCGGG ACATGATGGC CGCGCTGAAC GACATGCTCG ACGCCGACGC CCGCGGCGAG CACACCCAGG AGGACTTTGC GAACTTCATG CGCGAGTACG GGGACTTCTT CCCCGACAAC CCGCGCAACC TGGAGGAGCT GGTCGACTCG CTGGCCCGGC GGGCGGCGGC GATGGACCGG CTGCTGGCCT CGCTGAGCCC CGAGCAGCGC CAGGAGCTGG CCGCGCTGAT GGCGCAGGTG ATGGAGGACG CCGGGCTGGC GATGGAGATG ACCCGGCTGG GCGAGGCGCT GCGGGCCCGC CGCCCCGACC TGGGCTGGGG GACCCCGGAG CGGATGAGCG GCTCGGACCC GCTGAGCGTC AGCGACGCCA CCGCCGCGCT GGCGGAGCTG GCGGACCTGG CCGAGCTGGA GGCTGCGCTG GCGCAGGACT ATCCGGGCGC CAGCCTGGAC GACATCGACG AGGAGGCGGT GCGCCGGGCG CTGGGCCGCC AGGCGGTCGA CGACCTGGCC GAGCTGCGCC GCATCGAAAA AGAGCTGGAA CGCCAGGGCT ACCTGCAGCG CAGTGGCGGC CGGCTGGAGC TGACCCCCAA GGCGGTGCGC CGGCTGGGCG AGACCGCGCT GCGCCGGGTG TTCTCCCACC TGGAGGGGGG CCGGCGCGGC GACCACGACC AGCGCGACGC CGGGCAGGCC GGGGAGCTGA CCGGCTCCTC ACGTCCCTGG CGGTTCGGCG ACGAGCAGCC GCTGGATGTG GTCCGCACGG TCGGCAACGC CATCCGGCGC AACGCCCAGA ACCCCACCGG CGACCGGTCG GTCAAGCTCA GCGTGGACGA TTTCGAGGTG CTGGAGACCG AGCGGCGCAC CGCGGCGGCG GTGTGCCTGC TGGTGGACCT GTCGTACTCG ATGGTGCTGC GCGGCGCGTG GGGGGCGGCC AAGCAGACGG CGCTGGCGCT GCACTCGCTG GTCACCGGCA AGTACCCGCA GGACGCCATC CAGATCATCG GGTTCTCCAA TTACGCCCGG GTGCTGCGTC CCACCGAGAT GGCCGCGCTG GACTGGGACA TGGTGCAGGG CACCAATCTG CACCACGCGC TGATGCTGGC CGGACGGCAC CTGGACCGGC ACCCGGACTT CGAGCCGATC GTGCTGGTGG TCACCGACGG CGAGCCCACC GCCCACCTGC AGCCCAACGG CCGTTCGCTG TTCGACTACC CGCCCTCCCG CCAGACGCTG ACGCTGACGC TGGCCGAGAT CGACAAGATG ACCCGGCGCG GCGCCACCTT GAACGTGTTC ATGCTGGCCG ACGACCCCCG GCTGGTGTCG TTCGTGGAGG AGGTCGCCCG GCGCAACGGA GGCCGGGTGT TCGCCCCCGA GGCCGGCCGG CTCGGCGAGT ACGTGGTCAG CGACTACCTG CGGATGCGCC GGGGATGA
|
Protein sequence | MSRYRYGAYQ GGPDPLEPPY DIRSALDAMG DSVLEGSSPG EALRALLRRG LPGGRDRRAR RGLDELLRQV RQRRRELRER GRLDGVLEQA RALLDTAIGQ ERAELFPDPS DEARMREAEL DALPSDTAQA IRRLSDYDWR SPAARATFEQ LKDLLRRDVL DAQFQGMRQA LANPDPQAMQ RVRDMMAALN DMLDADARGE HTQEDFANFM REYGDFFPDN PRNLEELVDS LARRAAAMDR LLASLSPEQR QELAALMAQV MEDAGLAMEM TRLGEALRAR RPDLGWGTPE RMSGSDPLSV SDATAALAEL ADLAELEAAL AQDYPGASLD DIDEEAVRRA LGRQAVDDLA ELRRIEKELE RQGYLQRSGG RLELTPKAVR RLGETALRRV FSHLEGGRRG DHDQRDAGQA GELTGSSRPW RFGDEQPLDV VRTVGNAIRR NAQNPTGDRS VKLSVDDFEV LETERRTAAA VCLLVDLSYS MVLRGAWGAA KQTALALHSL VTGKYPQDAI QIIGFSNYAR VLRPTEMAAL DWDMVQGTNL HHALMLAGRH LDRHPDFEPI VLVVTDGEPT AHLQPNGRSL FDYPPSRQTL TLTLAEIDKM TRRGATLNVF MLADDPRLVS FVEEVARRNG GRVFAPEAGR LGEYVVSDYL RMRRG
|
| |