Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1710 |
Symbol | |
ID | 7084130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1923175 |
End bp | 1925697 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698731 |
Product | von Willebrand factor type A |
Protein accession | YP_002355361 |
Protein GI | 217970127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAAC AATCGGCACG ACTTGAGACG CGCGCGGGTG AGGCACTGAC GCTGCAGGGC GTGCGGTTCA CCGGCACCTT GCGCGGCACG CTCTTCGAAG CGGAGCTCGA GCAGCGCTTT GCCAACCCCT TCGAGCGCCA TGTCGAGCTC GTCTACAGCT TCCCGCTGCC GTGGGCGGCG GTGCTCCTCG GGGTGGAGGT GCGGATTGGC GAGCGCTGCC TGTCCGGAGC GGTCATCGAG AAGAAGCAGG CCGAGCAGGG CTACGAGGAC GCGCTGGCCG AGGGCAACAC CGCCATCCTG CTCGAGCAGA ACTTCGATGG CAGCTACACG CTGAACCTGG GCAACCTCGC ACGCGGAGAA ACCTGTGTGC TGCGGCTGCG TTACGCCCAG GTGCTGCAGT TCGAGCAGCA TGGCCTGCGC CTGGTGGTGC CCACGGTGAT CGCCCCGCGC TACGGCGATT CGGTGGCCGA TGCCGGCCTG AAGCCGCATC AGGTGGTCGA GCACGACCTG ATGGCGGTCC ATCCGCTTGA GCTCACCTTG CGCATCGAAG GCGAGCTCGC GCGGGCGCGC ATCGGCTCGC CGAGTCATCC GTTGTCGATG CGGCTCGAAG GAGAGGGGGA GACGGCGGCG ATGCTCGTAT CGCTCGGTCG CGGCGGGGCA CTCGATCGCG ACTTCATCCT GGTGCTCGAC GAGGTTGCGC AGGATTCGCT TGCCGTGTGC GCGCAAGACA CCCTCGATGA GGGCGCGGTG AATGTGCTGG CGAGCTTCTG CCCGCGTGTG CCGGCAGCGG CCCATCCGCT CGCGGTCAAG ATCCTCGTGG ATTGCTCCGG CTCGATGCAG GGCGACAGCA TTGCCGCTGC GCGCCGTGCG CTGCAGGCCA TCATCGCCGG CCTGCGCGAG GGCGAGCGCT TCTCGCTGTC ACGCTTTGGC AGCACGGTCG AGCATCGCTC GCGTGCCTTG TGGCGCACCA GCGCCGCCAC CCGCCAGGCG GGACAGCGCT GGGCGATGCA GTTGCAGGCC GATCTCGGTG GCACCGAGAT GGAGAACGCG CTGGCATCCA CGCTGGCCCT GGCCGGAGAT GCCGAGCCGA GCCCTGGGAC GGAGGAGGGG GCCGCAGCGG TCGATCTACT GTTGATCACC GACGGCCAGA TCCACGCCAT CGATCGCACG GTGAAACGGG CGCGAGCGCT GGGGAATCGG ATCTTCGTGG TCGGCATTGG CAGCGCGCCG GCCGAGGGTG TGCTGCGGCG GCTGGCTGAC GAGACGGGCG GCGCCTGCGA CTTCGTCGCG CCCGGCGAGG CCGTGGAGCC GGCGGTGCTG CGCATGTTCG CGCGTTTGCG CTCGCAGCGC ATGGACGCCC TGCAACTGGT GTGGCCGACG GGTGCCGAGC CGGTGTGGAT GAGCCCGCTG CCCGGCTCGG TCTTCGATGG CGACGCAGTC ACGGTGTGGG CGCGCTTTGC GCAGGTGCCG CATGGGTGGG TGCCGGATCA GCCGGTGCGT CTGGTCGGCC GGATGGCACA GGGGGACGCG CCCGTAGCGC TGGGTGAGGC ATCGCTCAGT GCCGTCGAGC CGCATGCCGT GCTGGGCCGC ATGGCGGTGG CCGCGCAGGT CGAGCAGCTC CTCGCCGGCG AAGGGGCGCA GTCACTCCGG GCACCGGAGG CGCTCGCGCT GGCCGTCGCG TATCAACTGG TCAGTCCGCT GACGCACTTC CTGCTGGTCG AGGCGCGTGC CGAAGCCGAC AAGCCCGAGG ACATGCCCGA TCTCGTGAAG GTCCCCGCGA TGCTGCCTGC AGGCTTTGGC GGGCTGGGAA GCGTCGATCT GCTTTACATC GAGCCCAACA TGGCGCCGCT CACGGTGAAT GACACGCCCA CCGCGTATGG CGCGCCGGCA GCGGCGGGCT TCGGGTCGTT TGACATGAAC GCGCTCGATG CGCCGGCCGT GATGCGCTCG GGTGCGCGCG TGCTGCGCAG CGAAAGGCGA ACACAGGCCT CCACCTACGA TATCCCGGCA TTCCTGCGAC GCGGCGCTGA CCGGGCAGCC GATGAGCCGC CGCGCGACGA TCCGCGTTAC TGGTCTGCCG AGCCTCATTA CAGCGGCCTC ACACCGCTGG GCCTGACCCA CTGGCTGCGC AGTCATACGC AGGCCGAATG GCCGCGCACG TATGCGGCGC TGCACAGGAT CGGTGTCGGT ACGGCGGTGC TCGACTGGCT GGAGTTCGTC CTGGCCGACG GGGAGGACGA GGCGCAGGTT GTGGCCTGCT TCGTCGAGGC GATGGCTCAG CGCGAATTGC ATGACGCGCT GCGCTCGACC AACGGCATGC TCGGACGGCT CAAGGCCCTG ACGCAGCGGG TGAGGCCGGC CGCTGCGCCT GAGGCCCGAC AGGTTGAGGT GGCGTCTGAG TCGCTCGCGG CGCGCCTGCA GGTGTTCGTC AGCACGCTCC AGGCGGAAAC CTGGCCCGAC TGCGTCTTTG CGCTCCGGGA CGCGGCCTCG GCGCTCGAAC GGTCGGGTGT CGGCGTTGAA TAG
|
Protein sequence | MQQQSARLET RAGEALTLQG VRFTGTLRGT LFEAELEQRF ANPFERHVEL VYSFPLPWAA VLLGVEVRIG ERCLSGAVIE KKQAEQGYED ALAEGNTAIL LEQNFDGSYT LNLGNLARGE TCVLRLRYAQ VLQFEQHGLR LVVPTVIAPR YGDSVADAGL KPHQVVEHDL MAVHPLELTL RIEGELARAR IGSPSHPLSM RLEGEGETAA MLVSLGRGGA LDRDFILVLD EVAQDSLAVC AQDTLDEGAV NVLASFCPRV PAAAHPLAVK ILVDCSGSMQ GDSIAAARRA LQAIIAGLRE GERFSLSRFG STVEHRSRAL WRTSAATRQA GQRWAMQLQA DLGGTEMENA LASTLALAGD AEPSPGTEEG AAAVDLLLIT DGQIHAIDRT VKRARALGNR IFVVGIGSAP AEGVLRRLAD ETGGACDFVA PGEAVEPAVL RMFARLRSQR MDALQLVWPT GAEPVWMSPL PGSVFDGDAV TVWARFAQVP HGWVPDQPVR LVGRMAQGDA PVALGEASLS AVEPHAVLGR MAVAAQVEQL LAGEGAQSLR APEALALAVA YQLVSPLTHF LLVEARAEAD KPEDMPDLVK VPAMLPAGFG GLGSVDLLYI EPNMAPLTVN DTPTAYGAPA AAGFGSFDMN ALDAPAVMRS GARVLRSERR TQASTYDIPA FLRRGADRAA DEPPRDDPRY WSAEPHYSGL TPLGLTHWLR SHTQAEWPRT YAALHRIGVG TAVLDWLEFV LADGEDEAQV VACFVEAMAQ RELHDALRST NGMLGRLKAL TQRVRPAAAP EARQVEVASE SLAARLQVFV STLQAETWPD CVFALRDAAS ALERSGVGVE
|
| |