Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0052 |
Symbol | |
ID | 7316702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 50615 |
End bp | 52384 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 643614942 |
Product | von Willebrand factor type A |
Protein accession | YP_002512143 |
Protein GI | 220933244 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.649054 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTGG GCACACAACT GGCGCTGCTG GAGTGGCGCG CGCCCCTGTG GCTGCTGCTG GCCGGGCTGC CCTGGCTGCT GGCGGCCCTG GGGCGGCTGC GGGAGCGGCC CCTGCGCCGC TATGCCGACC GTGCCCTGCG GCCCTGGGCC CTGTACCAGG CCGGCCGGGA CACGGGGCTG CGCTTCGGGC TCCATGCCCT GGCCTGGCTG CTGCTGGCCG CCGCTCTGGC CGGGCCCCGG GTACCGGCCC CGGTACAGGT GGCCGGGGAG GGCCACCGGG GGGATGTCTC CCTGATGCTG GTGCTGGACG TCTCGGCCAC CATGCAGGCG CAGGACCTCG CCCCCCGGCG CCTGACCCGG GCCCTGCTGG AGGTGGATGG CATGTTGGAA GGGCTGCGGG GGGAGCGGGT GGGCCTGGTG GCCTTCGCCG GGCGCGCCCT GATGCTCGCC CCGCCCACCC ACGACCGCCG CCTGCTGAGC CATTACCTGT CCCGGGGCCC CGAGGCCCTG GCGGACCCCG CCGGTCTGTC CGCCAGCCGG GCCGTGGCCG AAGGCCTGCG ACTGGCCGGC GAGGCCCTGG AAGGCAGCGG CGCGGTGGTG CTGATTACCG ATGGCGACGC CCGGGCCTTT GCCGGGGCGC GGCTGGCCGC CATGCAAACC CAGGCGCGGG CCCTGCGGGA CGCGGGCCAC ACCCTGTACG TGCTGGGCGT GGGGGGCACG GAGCCGGTGC CGGTGCCCGA CGGCGCGGGT GGCCTGCTGC GGGAGGGGGG CGAGACCCTG CTCTCGCGCC TGGATGCCGA GGCGCTCACC CGCCTGGCCG AGGTCGGCGG CGGCCACTAT GCCCAGGCCG GCGCACTGGG CCAATCTTCA CTGGGCCGGG CCTGGGAGCG GCTCTACGCC GACGCGGTGG TCCGTCTGCC CAGCAGCCGT TCCGTGACCC ACTGGCAGGA ACTCCACCCC TGGCTGCTGG CGCCGGGCTT GGTCCTGCTG CTGCTCTCCT GGCTGCGCCC CGCGCCCGCG CCCGCGTCCG CCCTGGTGCT GGCGCTGCTG CTGCCCCTGC AGCCCGCCCC GGTGTCGGCC GACGAGCCTG CCCTGGCGCT GCAGGCACAC AGCGCCTGGA ACGCCGGGGC CTTCGCCCGG GCCCTGAACG CCTATGCACG CCTGCCCGGC CACCCCGGGC GCCTGGGGGA GGGGGCCAGC GCCTACCGGC TGGGGGATTT CGAGCACGCC GCCGAGGCCT TCATCCAGGC CTGGCTGACG GCCCCCGGCG ACCGGGAGCG GGCCGAGGCC CTGTTCAACC TGGGCAATGC CCGCTTCCGC CAGGAGGACT ACGCCGCCGC CGTGGAGACC TACCGGGGGG CCCTGGCCTA CCCCTCGCCC CATCGCCAGG CCATCGAGCA CAACCTGGCC CTGGCGGCAG GCCGAGAGCG GGAGGCGGGC GGCCCCGGCC TGGCGGGGCG CCGCGCCCGC CAGGCCACCG AGGACGTGGG GCGCATGGAC CTGGAGGAGA CCCCGGGGTT TCCCGAGGAC GAGGAGACGC CCCGGCTGCC CACCGAGGAG GGGGACCGGG AGGCCCTGGG TGAGGCGGTG GCCCGGGGCG AGCTGGAGGG CCGGGCCGAG CGGCCGGCGG GCGGCCCCGG GCGGCGCCTG GCCGTGGACC GGGAATACGA CGCAGCCCTG CTCAAGCAGG ATCAGCTGGA GGACCGGGAT CTGCGCCTTT GGCGCGCCCT GCGCGACCGC GAATCGCCCG GTGGCGGGGG CGCGCCATGA
|
Protein sequence | MNLGTQLALL EWRAPLWLLL AGLPWLLAAL GRLRERPLRR YADRALRPWA LYQAGRDTGL RFGLHALAWL LLAAALAGPR VPAPVQVAGE GHRGDVSLML VLDVSATMQA QDLAPRRLTR ALLEVDGMLE GLRGERVGLV AFAGRALMLA PPTHDRRLLS HYLSRGPEAL ADPAGLSASR AVAEGLRLAG EALEGSGAVV LITDGDARAF AGARLAAMQT QARALRDAGH TLYVLGVGGT EPVPVPDGAG GLLREGGETL LSRLDAEALT RLAEVGGGHY AQAGALGQSS LGRAWERLYA DAVVRLPSSR SVTHWQELHP WLLAPGLVLL LLSWLRPAPA PASALVLALL LPLQPAPVSA DEPALALQAH SAWNAGAFAR ALNAYARLPG HPGRLGEGAS AYRLGDFEHA AEAFIQAWLT APGDRERAEA LFNLGNARFR QEDYAAAVET YRGALAYPSP HRQAIEHNLA LAAGREREAG GPGLAGRRAR QATEDVGRMD LEETPGFPED EETPRLPTEE GDREALGEAV ARGELEGRAE RPAGGPGRRL AVDREYDAAL LKQDQLEDRD LRLWRALRDR ESPGGGGAP
|
| |