Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2705 |
Symbol | |
ID | 7873447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2932222 |
End bp | 2933967 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643699628 |
Product | von Willebrand factor type A (vWA) domain-containing protein |
Protein accession | YP_002889684 |
Protein GI | 237653370 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCG GCGATGCAGG TTCCGGCGTG GCGCCCGCCG TCGCGCAGCA ATCGGCCTCC GCGGCGGCCG ACAGCGCGCC CGCGCCGCTG CCGCCGGACC GCGCCCGGGC CGCCGAGGCT CCACCCGTCC CCCGGCTCTT CGAGGCGCTC GAAGCCGTGC TCGCGCCGCT CGATCGCCTG CCGCGTCCGC TGTGGCTCGG TGGCATGACC CACTCCGGCG GCGAACTCGC GGGCCGTCTC GCCGCGCTCG AAGCCTGGCG GGCTGCGCTG CTCGCCGGCG GCCTGCCCGG GTCTTGCGCG GCCTGGCCCG AGGCGGAGGT GCAGGAGGCC GTGCGCGAGG TCTTCCGGGG CCTGGGCTTG CCCGCCTACT GTGCAGACCA GCCCGCGCTG GTCGATACCG TGCTGCAAGG CCTGCTCTTC CACCTCGACC TGATCGTCGA CTACCGCGAC CGCGGCGATA CCGAGGCCGC GGCGCAGGCG AGGGCGCTCG ATTCCTTCGC TGCCGACTGG GCCGAGCGCT GCGGCGAGAT CGACGAGCTG GTCGGCGCCT TCGGGGATCT CGGCGATCTG CTCGACAACG CCCGCTGGGA TGCGCTGCGC GGCCTGCTGC GCAGCACGGA CTGGCGCGAG GTGCTGCGCA TCCGCGCGCT GATCGAGGGC CTGCCCGAGC TCGCCCGCAT CCTGCGCGCA CTCGGCCGCG CCTGCCCCAC CGACGAGGAC GCCGAATCCA GCCGGGCATT GCACGCCGTG GTCGAGCACA CCGAGATACA GCGCAGCGTC TCGCACCGGG TGCGGGTGCC CGACCTGCCC GGCGAAACGC GCGGCGTGCA GCGCTCGGGC CGGATCGCGC GCATGCTGCC GGCCGAGGCG ACGCTGCTCG GCCATCCGCG CCTGCGCTTG GTCTGGCATG CGCGCCGCGC CGAGCGCACG CTGCTCGCCT ACGAGGACGA CGACCACCTG CAGGAGGACT GCCTGCGCCC GGCGCCGGTG CTGCGCCCCA GTCAGCGCCC TGCGCCGGCA CGGCGCCTGG AGCAGGGGCC GATGCTGGTG TGCGTGGATA CCTCGGGCTC GATGCAGGGG GGCGCCGAGG CGGTGGCCAA GGCGGTCGTG CTGGAGGCGG TGCGCTGCGC TCACGCCCGG CGCCGCGCCT GCCGGGTGTA TGCCTTCGGC GGGCCCGACG AGGTGGTCGA GATGGAGCTC GGCGTCGATG TCGATGGCGT CGGCCGGCTC GCCCGCTTCC TCGGCCAGGG CTTCGGCGGC GGCACCGACA TCTGCGCCCC GCTCGAGCGT GCGCTCGCCC GCCTCGACGA AGCCGGCTGG CAGCTCGCGG ACCTGCTGAT CGCGTCCGAT GGCGAATTCG GCGCCACCCC GGCGCTCGCC GCCCGCGTCG AGGCCGCCCG CCGCGAGCGC GGCCTGCGCG TGCAGGGCAT CCTGATCGGC GACCGCGAGA CCATCGGCCT GCTCGAACTC GCCGACGACA TCCACTGGGT GCGCGACTGG CGGCGCTATG GAGGCGGTAC GGACAAGCCG GGTGGGGCCG ACGCGGACAA GCGGCGTGGC GATGGCGCCG GCGCGAACGC CCCGACCCTT GCTGCCGGCG GTGGCTCGCC GGTGCATTCC AGCCACCTCA CCGCGGACTA TTTCCCCGGT GCCCTGCGCA CCCCCGAGAA CCGTGCCGCC ACCGTCACAC CCGAGGCGGC TGCCGCTGCC ATCCGTGCCG GCCGCCACCG CGGCGACCGG TTGTAG
|
Protein sequence | MATGDAGSGV APAVAQQSAS AAADSAPAPL PPDRARAAEA PPVPRLFEAL EAVLAPLDRL PRPLWLGGMT HSGGELAGRL AALEAWRAAL LAGGLPGSCA AWPEAEVQEA VREVFRGLGL PAYCADQPAL VDTVLQGLLF HLDLIVDYRD RGDTEAAAQA RALDSFAADW AERCGEIDEL VGAFGDLGDL LDNARWDALR GLLRSTDWRE VLRIRALIEG LPELARILRA LGRACPTDED AESSRALHAV VEHTEIQRSV SHRVRVPDLP GETRGVQRSG RIARMLPAEA TLLGHPRLRL VWHARRAERT LLAYEDDDHL QEDCLRPAPV LRPSQRPAPA RRLEQGPMLV CVDTSGSMQG GAEAVAKAVV LEAVRCAHAR RRACRVYAFG GPDEVVEMEL GVDVDGVGRL ARFLGQGFGG GTDICAPLER ALARLDEAGW QLADLLIASD GEFGATPALA ARVEAARRER GLRVQGILIG DRETIGLLEL ADDIHWVRDW RRYGGGTDKP GGADADKRRG DGAGANAPTL AAGGGSPVHS SHLTADYFPG ALRTPENRAA TVTPEAAAAA IRAGRHRGDR L
|
| |