Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0872 |
Symbol | |
ID | 8806627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 928288 |
End bp | 930609 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003460123 |
Protein GI | 289208057 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.743934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTC ATCTGGAGGA CTACCAGGAC CTCCTCGAAG ATCTGGGGAA CCACGCCGCC GAGGTGCTGG AGAGTTCCTG GCAGGAGGCC AGCCGTGCCT TCAGCGCCGG TGGTCTCAAG CGCTACTACC TTGAAGGGGC CCGGAGTCTT CAAGCCCTGG GACGCGGTTC GGAGATGGTC GTCGCCTTCG TGCAGGAGGC GCCGGCACTG GCCCAGGAAC TGGGCGAGGA TGCCGTTGCG GAGCTGCTCG GTTCCGCGAT CAAGATGTAC TCCAAGACCT CCGCGGCGGT CATTACCCTG GTGATCGAGA CCAGCCCGGT TGCGGCCCAG CGGCTTGGCG ACTATTCGCT GTTCGAAGGG TATCTGCGCC TGCTGGACAA CCTGCTGGGG CAGGCGCCGC GTGGCGTGCG CCCGATGCTG GAGCACCTCG ACGAGCTGCT AACCCACCTG ACGCTGGGTG GGCTTCGGCG CTGGGCGACG TGGGGCGCAC AGGCGCACCG CACGGACCTG GAGGCGCAGA TTGCCTACTT CGGCCTGGAG TCGCCGGAGT CGCGCTCGGT GTTCCAGAAG GAGCGCCGCG GCACCCTGTT CGTGGACGTG CAGCGACGCA TCAACCTCTA TCTGCGCTCG CTGTGGGCGC GCGATTTTTT CATGCGCCCG ACCAGTGGCG ACTTCGAGTC GCGCGAGGGA TATCGGCCCT ACATCGAGGG CTTCACGATC TTCCTGCCGG ACGCCTACGA CGATGTCGAG GGTCTGACCG GTCTGGATCT CTATCGCGCG GCGGCCGCGC ATGCGGCATC CCATGTGGTG CATACCCGCG AGATGCAGTC GGAGGAGGGG CCACGGGGCG TGGTTGGCAT CCTGACCGAG CTGTTCGAGG ATGCGCGCAT CGAGGCCCTG GCGATCGCCG CGTTCCCGGG CCTGCGCCAG ATCTGGGTGC CGCTGCATAC CGCCACGGCG GAGGATGGAG AAACCCCGCA GGCATTGCTG GCGCGCACCG CGCGTGCCCT GCTCGATCCG GATTACTACG ACCCCCATGC GTTTGTGCAG AAGGCCGTGA GTGCGTTTGC GGAACGGGTA GACGACCTCG AGGCCGCTGG ATTGGCCCGC GAGCTGGCCG AAGCGCTGGC GCCCGAGTTC CCCGAGCAGG GCTACATGAA TCCGGCCAAG GTCGCGCCGG ACGTCCTCTA TCGCGACGAC AACCGCTATC TCTGGAACTT CGGCCGCGAA GAGGAAGCGG CGGAAGTCCT GGCCGCGCAT GGCTCCGGGC AGGAGCGATT CTACGTCACC CCGATGCAGA TGATTAACAC CCTGGACGTG CCGAATGCCG GGGACGACGC CCAGCAGGTC ATGGTGCTGG CGACCGAGTG GTTCCGCGAC GGCGAAGAAG AAAGCATTAA CCAGCAGGAG GGCAAGGAGC CGATCTCGCT GCCGTTCCAC TATCACGAGT GGGACTATCA GATGCAGCTG GAACGTCCGC ACTGGGTCAC ACTGCTGGAG CGGCGCCCGA GAAAGGGCGA AATCGAGGAG ATCGAGGCGA TCTACGACAA GCACAAGCCG CTGGTCTCGC GCCTGAAGTT CCTGATCGAG GCGATGCAGC CCCAGGGCGT GCAGCGCATG CGCAAGCAGC CGGAGGGCGA CGAGCTGGAC GTGAACGCGC TGGTCGAGGC CCAGATTGAT GTCCGCATGC GGCGCCAGCC CGACGAACGC ATCTATGTGC GCAATCTGCG CCATGTACGC GACCTGTCCG TACTGGTGCT GCTGGATCTG TCGGAGTCGA CCAACGAGAC GATCGGCGAC GGCGAGACGA CCGTGGTGCA ACTGGCGCGC GAGGCGGCCA CGCTGCTGTC CGGCGCGATC TCACGCATTG GCGACCCGTT CGCCATCCAC GGCTTCGCCT CCGATGGGCG CCACGACGTC GAGTATTACC GCTTCAAGGA TTTCGATGCC CCGTTCGACG ACACCGCGAA GGCGCGTCTC GCCGGGATGA AAGGCCAGCT ATCCACCCGC ATGGGAGCGG CTATCCGCCA TGCCGGCCAG TTGCTGCACA ATCAGGGCAC GGCGAAAAAG CTGCTGCTGA TCATCACCGA CGGCGAGCCG GCGGATACCG ATGTGCGCGA CCCGCAGTAC CTGCGTCAGG ATGCGAAGCG GGCGGTAGAG GAGGTCGGGC GCAAAGGCGT GTACACCTTC TGCATGAGTC TCGATCCGCA GGCGGATGAA TACGTCGAGC GCATCTTCGG GGCCCAGCAT TTCATGGTGC TGGACCAGAT CGAGCGGCTG CCGGAAAAGC TGCCGATGCT GTACGCCGGT CTTACGCGGT AA
|
Protein sequence | MSVHLEDYQD LLEDLGNHAA EVLESSWQEA SRAFSAGGLK RYYLEGARSL QALGRGSEMV VAFVQEAPAL AQELGEDAVA ELLGSAIKMY SKTSAAVITL VIETSPVAAQ RLGDYSLFEG YLRLLDNLLG QAPRGVRPML EHLDELLTHL TLGGLRRWAT WGAQAHRTDL EAQIAYFGLE SPESRSVFQK ERRGTLFVDV QRRINLYLRS LWARDFFMRP TSGDFESREG YRPYIEGFTI FLPDAYDDVE GLTGLDLYRA AAAHAASHVV HTREMQSEEG PRGVVGILTE LFEDARIEAL AIAAFPGLRQ IWVPLHTATA EDGETPQALL ARTARALLDP DYYDPHAFVQ KAVSAFAERV DDLEAAGLAR ELAEALAPEF PEQGYMNPAK VAPDVLYRDD NRYLWNFGRE EEAAEVLAAH GSGQERFYVT PMQMINTLDV PNAGDDAQQV MVLATEWFRD GEEESINQQE GKEPISLPFH YHEWDYQMQL ERPHWVTLLE RRPRKGEIEE IEAIYDKHKP LVSRLKFLIE AMQPQGVQRM RKQPEGDELD VNALVEAQID VRMRRQPDER IYVRNLRHVR DLSVLVLLDL SESTNETIGD GETTVVQLAR EAATLLSGAI SRIGDPFAIH GFASDGRHDV EYYRFKDFDA PFDDTAKARL AGMKGQLSTR MGAAIRHAGQ LLHNQGTAKK LLLIITDGEP ADTDVRDPQY LRQDAKRAVE EVGRKGVYTF CMSLDPQADE YVERIFGAQH FMVLDQIERL PEKLPMLYAG LTR
|
| |