Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2129 |
Symbol | |
ID | 7085399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2403353 |
End bp | 2405854 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699148 |
Product | von Willebrand factor type A |
Protein accession | YP_002355765 |
Protein GI | 217970531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACAAC AACAATCGGC ACGACTCGAG ACGAGGGCGG GTGAGGCACT GACGCTGCAG GGCGTGCGGT TCACCGGCAC CTTGCGCGGC ACGCTCTTCG AAGCGGATCT CGAACAGCGC TTTGCCAACC CCTTCGAGCG CCATGTCGAG CTCGTCTACA GCTTCCCGCT GCCCTGGGCG GCGGTGCTCC TCGGGGTGGA GGTGCGGATC GGCGAGCGCC GCCTGTCCGG TGCGGTCATC GAGAAGAGGC AAGCCGAGCA GGGCTACGAG GACGCGCTGG CCGAGGGCAA CACCACCATC CTGCTCGAGC AGAACGTCGA TGGCAGCTAC ACGCTGAACC TGGGCAACCT CGCACCCGGG GAGACCTGTG TGGTACGGCT GCGTTACGCC CAGGTGCTGC AGTTCGAGCA GCATGGCCTG CGCCTGGTGG TGCCCACGGT GATCGCCCCG CGCTACGGCG ACCCGGTGGC CGATGCCGGC CTGAAGCCAC ATCAGCTGGT CGAGCACGAC CTGATGGCCG TCTATCCGTT CGAGCTCACC TTGCGCATCG AAGGCGAGCT CGCGCGGGCG CGCATCGGCT CGCCGAGCCA TCCGTTGTCG ACGCGGCTCG AAGGAGAGGG GGAGACGGCG GCGATGCTCG TATCGCTCGG TCGCGGCGGG GCACTCGATC GCGACTTCAT CCTGGTGCTC GACGAGGTTG CGCAGGATTC GCTTGCCGTG TGCGCGCACG ACACCCTCGA TGAGGGCGCG GTGAATGTGC TGGCGAGCTT CTGCCCGCGT GTGCCGGCGG CGGCCCATCC GCTCGCGGTG AAGATCCTGG TGGACTGCTC GGGCTCGATG CAGGGCGACA GTATCGCCGC TGCGCGACGT GCGCTGCAGG CCATCGTCGC CGGCCTGCGC GAGGGCGAGC GGTTCTCGCT GTCACGCTTC GGCAGCACTG TCGAGCATCG CTCGCGCGCG CTGTGGCGCA CGAGCCCCGC TACCCGGCTG GCCGGGCAGC GCTGGGCGGC ACAGTTGCAA GCCGACCTGG GCGGCACGGA GATGGAGAAG GCGCTGGACT CCACGCTGGC CCTGGCCGGA GACGCATCGG TAAGCCCCGG TGCTGGGGAA GGTGCCGCGC CGGTCGATCT GCTCCTCATC ACCGATGGCC AGATCCACGC CATCGACCGA ACGGTGGCGA AGGCGCGCGC GCTGGGTCAC CGGGTGTTCG TCGTCGGTAT CGGCAGCGCC CCTGCCGAGG GTGTGCTGCG CCGCCTGGCC GAGGAGAGCG GCGGGGCCTG TGACTTCGTC GCCCCCGGAG AGACTGTGGA GCCTGCCGTG CTGCGCATGT TTGCGCGCCT GCGCTCGCAG CGCATGGCGT CCCTGGCGCT TGCGTGGCCG GCTGGTGCGA AGCCCCTGTG GATGAGCGCG CTACCCGGCT CGGTGTTCGA TGGTGACGCG GTGACCGTGT GGGCACGCTT CGCGCAGGTG CCGGATGGGA CGGTGCGTCT GATCGGCCGG CGCACGCACG CCTCGGTGCC CGAGTCGCTG GGCGAAGCCT GCCTGACGGC TGCGGAGCAT GATTCGGCGC TCAGCCGGAT GGCCGTGGCC GCGCAGATCG AAACCCTGCT CGCGACCGAG GGCGCCCAAT CGCGTCAGGC GCTGGAACTG GCCGTCGCGT ATCAGCTGGT CAGCCCGCTG ACGCATTTCC TGCTCGTCGA GACGCGTGCC GAAGCCGACA AGCCGGCGGA CATGCCCGAT CTCGTGAAGG TGCCCTCGAT GCTGCCCGCC GGCTTTGGCG GGCTCGGCAG TCTCGACTTC TGCATCGACC CCTGCATGGC GCCGCTCACT GTTAATGAGG CACCGGAGCA CTACGGCTCG CCGGCTGCGG CAAGCGGTGC GTATCTCGAC TTCGACGATC CCTTCGATGC GCCCGTGGTG CTGCGCTCGG GTCGACGTGC GGATCACGGC GATACGCCCA ACAAGGCTGG GACCTATGAC ATCCCGGCTT TCCTGCGCCG AAGCTCGAAC CAGGACGCCG GGCAGCCCCC CCGGGACGAT CCGCGTTACT GGTGTGCCGA ACCGCATTAC ACAGGTCTCA CTCCCCTCGG TCTGACACAG TGGCTCCGCA GTCACCCGCA GGCCGAATGG CCGCAACGTT ATGCCGAGTT GCGCCGACTC GGTGTCGGTA CGGCAGTGCT CGACTGGCTC GAGTTCGTGT TGGCTGAAGG GGAGGGTGAG TCGCTGGTGG TCGCCTGCTT CGTTCAGGTG ATGGCGCAGC GCGACCTGTA CGAAGCGCTG CTGTCGGACA CCGGCGCGCT TGGTAGGCTT AAGGCCTTGG CACAACGCGT GGCACCGGGC GCGGCGCTGA AGGTCAGCCA GGATGACCCT GCGGCTGCGT CGATCCTTGC TCGCCTGCAG GTGTTTGTAA GTACGCTGCG GGCAGAGCGC TGGCCCGACT GCGTCTTTGC GCTCCAGGAC GGGGCGTCGG CGCTCGAACA GTCGGGTGTC GGCGTTGGAT AG
|
Protein sequence | MIQQQSARLE TRAGEALTLQ GVRFTGTLRG TLFEADLEQR FANPFERHVE LVYSFPLPWA AVLLGVEVRI GERRLSGAVI EKRQAEQGYE DALAEGNTTI LLEQNVDGSY TLNLGNLAPG ETCVVRLRYA QVLQFEQHGL RLVVPTVIAP RYGDPVADAG LKPHQLVEHD LMAVYPFELT LRIEGELARA RIGSPSHPLS TRLEGEGETA AMLVSLGRGG ALDRDFILVL DEVAQDSLAV CAHDTLDEGA VNVLASFCPR VPAAAHPLAV KILVDCSGSM QGDSIAAARR ALQAIVAGLR EGERFSLSRF GSTVEHRSRA LWRTSPATRL AGQRWAAQLQ ADLGGTEMEK ALDSTLALAG DASVSPGAGE GAAPVDLLLI TDGQIHAIDR TVAKARALGH RVFVVGIGSA PAEGVLRRLA EESGGACDFV APGETVEPAV LRMFARLRSQ RMASLALAWP AGAKPLWMSA LPGSVFDGDA VTVWARFAQV PDGTVRLIGR RTHASVPESL GEACLTAAEH DSALSRMAVA AQIETLLATE GAQSRQALEL AVAYQLVSPL THFLLVETRA EADKPADMPD LVKVPSMLPA GFGGLGSLDF CIDPCMAPLT VNEAPEHYGS PAAASGAYLD FDDPFDAPVV LRSGRRADHG DTPNKAGTYD IPAFLRRSSN QDAGQPPRDD PRYWCAEPHY TGLTPLGLTQ WLRSHPQAEW PQRYAELRRL GVGTAVLDWL EFVLAEGEGE SLVVACFVQV MAQRDLYEAL LSDTGALGRL KALAQRVAPG AALKVSQDDP AAASILARLQ VFVSTLRAER WPDCVFALQD GASALEQSGV GVG
|
| |