Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2875 |
Symbol | |
ID | 8829287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013930 |
Strand | + |
Start bp | 227856 |
End bp | 229703 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003494826 |
Protein GI | 290243156 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0100265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCAGC AATTCATCAA GCGCTCGTTT CACCTCACGG CACAGGCACT TGCCAACCGC ATGGGGGTTC GAGTCACTTT CGGGGGTCAG ATGGCGTATG CCACACCTGA CCACACCATC AACCTGCCGA ATCTCCCAGC GGATTCCGTC ATTGCGCGGG CCCTGGGGCT TTGGTACGTC ATTCACGAGG TTTCTCACCT CAATGAAACG GACTTCGATT GGTGCAAGAA GTGTTCTCCC GTGCGTTTTC GCATGTTGAA CATTGTCGAG GATCATCGAA TCGAGTCCGC AGCCATGAAA CGCTGGCCCG GGGCACGCAG CATCATTGTC GAGGGTCGTG AGAAGTGGCT AGACGAGGGA AAGTTGGGAA TGCCGGTTCT GGGCTCCGGT CCGGCGCTGC ATCTTCTCAA TGGCACCCTC GCCCTGCTGA ACGCCCGCCT TTTCGGGGCA GACTCGGACG CATTGAGAAA TCTTGCGCCC GGGACAGTGT TCGCGCTGAA GATCCAGTTC CCCGAACTGG ACCTGAAGGC GTACCTGAAT CTTCTGTCCG AGGCGGACGA TCTTCAAACC ACCGAGGAAG CGAGCAACCT GGTCAACCGC ATCATTGAAT TGGTGACGGG GCAAACCGAG GATCAATCGC AGGAAGGAGG TCAGCAGGGA GGAAACGACT CCGGTGACGA AGCCGGTGGC GACGAAGCTG GTGGCGACGA AGCTGGCGGT GACGAAGCCG GTGGCGACGA AGCTGGTGGC GACGAAGCTG GTGGCGACGA AGCTGGCGGT GACGAAGCCG GTGGCGACGA AGCCGGTGGC GACGAAGCTG GTGGCGACGA AGCTGGTGGC GACGAAGCTG GCGGTGACGA AGCCGGTGGC GACGAAGCTG GTGGCGACGA AGCTGGTGGC GACGAAGCTG GTGGCGACGA AGCTGGTGGC GACGAAGCCT GGTCAACGCG CCCGACGGAG CAGATGCTTG ACCAGGAAGC GGACGGAATG GAAGAAACCT TCGATATGGG CGATGCCCTG AAGGAATCTA TCCAGGAAGT CAGCCAAAAG GAAGAGCAGG AAAAAGGTCG GTACACCGAC GGTCTGTGCC CTTTCACCTT CTGCGGCGAA ATGCCTGTGC AGGGATCGGG TGATCGGTTA GTCCAGCAAG CAGGGATCGC AAGTGCGGCT CTGCGGACAC GATTGGCGAG TCAGCTCCAA TCGGTCAACC GAGAGCGCCG TTGGGCCAGC CGGAAAGGCT CCAAGCTGAG TAGCCGGCAC CTGTCGCGTG TGGTGACCGG GGATCATCGG GTTTTTGGCA AACGCCAGGA ATCGGGGACC CCGAATACCG CCGTTCAGAT ACTCGTGGAC CGTTCCGGTT CCATGGCCGG TGATCCGATT GAGACTGCGA TGACGGCCGC GCTGGCGATT CAGCTGGCAA CCGACAGTCT TCGTGGGATT AACACTCAGG TCTCGGCTTT CCCGGCGTCT TCGTCAGGAG GGCTGGTGCC GATCACGTCG TTCGGGGAGA ACGGGCGCAT GAAGGCAGAC AACTTCGGAG TCGGCTCAAC AGGCGCAACG CCAATGAGCA ATGCGATTCT CGGGGTTCTG CCCAGCATGT TTGCTCGCTC CGAAAGCCGC AAGGTGATGC TGGTCATCAC CGACGGGGCC CCAAACGACT CGGAATCGGC TATGGAGGCG ATCCGAATGG CCCGTGATGT GAACGTCGAA ATGTATGCAA TCGGGATCGA AACGGATCCT TCGCATCTTT TCGGCGTTGA AAACACCACG GTCATCCAGT CGGTCGGTGA GTTGGCGGAG AATATCTTCG GACTACTGAC ACCGGTACTG ACCCGTACCG TCGCTTAA
|
Protein sequence | MFQQFIKRSF HLTAQALANR MGVRVTFGGQ MAYATPDHTI NLPNLPADSV IARALGLWYV IHEVSHLNET DFDWCKKCSP VRFRMLNIVE DHRIESAAMK RWPGARSIIV EGREKWLDEG KLGMPVLGSG PALHLLNGTL ALLNARLFGA DSDALRNLAP GTVFALKIQF PELDLKAYLN LLSEADDLQT TEEASNLVNR IIELVTGQTE DQSQEGGQQG GNDSGDEAGG DEAGGDEAGG DEAGGDEAGG DEAGGDEAGG DEAGGDEAGG DEAGGDEAGG DEAGGDEAGG DEAGGDEAGG DEAGGDEAGG DEAWSTRPTE QMLDQEADGM EETFDMGDAL KESIQEVSQK EEQEKGRYTD GLCPFTFCGE MPVQGSGDRL VQQAGIASAA LRTRLASQLQ SVNRERRWAS RKGSKLSSRH LSRVVTGDHR VFGKRQESGT PNTAVQILVD RSGSMAGDPI ETAMTAALAI QLATDSLRGI NTQVSAFPAS SSGGLVPITS FGENGRMKAD NFGVGSTGAT PMSNAILGVL PSMFARSESR KVMLVITDGA PNDSESAMEA IRMARDVNVE MYAIGIETDP SHLFGVENTT VIQSVGELAE NIFGLLTPVL TRTVA
|
| |