Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2605 |
Symbol | |
ID | 7873346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2809315 |
End bp | 2811228 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699528 |
Product | von Willebrand factor type A |
Protein accession | YP_002889584 |
Protein GI | 237653270 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4548] Nitric oxide reductase activation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAA CCGTAGGCCT GCTCTGGCAC CGCCTCATCA CCCGCGCCGC CGGCGGACAC TTCCCCAAGG CGGCGGTCCG CCTGCGCGAC ATCGAGAAGA CCGCGGGCGT GTTCTTCCGC GCCCTCGGCG GCGACCCCGG CCTGCGCCTG GCGGCCGCCA CCACCGACGA GCATGGCGCG CGGCGCAGCC TGCTGCAGCG CATCGCCGGC GCCGACGAAC GCGTCGCGCG CGCGCGCATG GACGTGTCCA CGCTGCGCCT GCCGCCCGAG CTCGACGTGC TGCCCGAGCG CGGCCTCAAC CGCGACCTCT ACCTCTGGCT CGCCGCCGTG GCCGCCGCGC CCCCCGCGGA GGATGGCGAC GGCCACCCCG CGCTCGGCCC CGTGCCCGAC GAGCCGATCG ACGAGGCCGC CACCCGCGAG CTGCGCGAGA ACCAGGCCGC CACCCTGCGC GCCCTGGCGC GCTGGCCGGG CATCCAGGCA CGCTACCGCC GCCTGGTCGA CGCGATGATC GCCCAGCGCC CCAGGCTCGA CAAGCTGCCG CCCGCCGAGG CCGAACGCGA GAAGCTGATC CGCCTCGCCC TGCACGCCCC CGGCAGCGTC GCCGCCCTGC CCCGCCTGCC CGCCAAGGCG CGCGCCACCC AGCCCGTGCT GCTGTGGCTG AGCGAGTTCC ACGCCAGCGC CGCGGGCGGC AGCGGCGGCG CCGGCGCGGG CGGCGAGCTC GGCGAGGCCG GCGCCAGCCG TCCCGGCAAG GACAGCAGCC GCCAGGCCCA CCGTGTCGAG CGCCAGGAGA AAATGCCGGA AAAGCACGGC ATGATCATCC CCTTCCGCGC CGAGAGCCTG CTGTCGGTGG CGGAGTTCAT CAAGGTCCAG CGCAGCACCG ACGACGAGCC CGACGACAAC GCCGCCGACG CCGCCGCCAA CCTCGACCAC CTGTCGATCA CCCGCGACGG CGAGCGCGTC GCCTCCAAGG TGCGCTTCGA CCTCGACCTG CCCTCGGCTG CCGAGGACGA CGTCGTGCTC GGCGACGGCA TCCCGCTGCC CGAGTGGGAC TACCGCAAGA ACCTGCTGCT CGAGGACCAC GTCCGCCTCG CCGAGCTCAC CCCCTCGATC CACGACCCGC GCGCCGCCCC CTGCGCCCTG CCCGAGCACC TGCGCCGCAC TGCGCGCCGG CTGCACCGCC AGTTCGCCGC GCTCACTCCG GGCAGACGCT GGCTCAAGGC GCAGGTCGAC GGCACCGAGC TCGACCTCGA CGCGGTGGTG CGCGCCGCCA CCGACCGCGC CACCGGCCAC CATCCGTCCG ATCAGCTCTA CCTCTCGCTC GAGAAGCGCG AGCGCGACCT CGCCTGCCTG GCGCTGGCCG ATCTGTCGCT GTCGACCGAT TCCTGGGTCT CCTCCGAGGC GCGTGTGATC GACGTCATCC GCGACTCGCT GCTGCTCTTC GGCGAAGCCC TGCTCGCCAC CGGCGACAGC TTCGCGCTAT GCGGCTTCTC CTCGGTCAAG CGCAGCAACG TGCGCTTCCA CCGCCTCAAG GACTTCGACC AGCGCTTCGA CGACCGCGCG CGCGGCCGCA TCATGGCGAT CAAGCCGGGC TACTACACCC GCCTGGGCGC GGCGATCCGC CACGCCACCA CGATCCTCGA CCGCCAGCGC GCCGCACGCC GCATCCTGCT GATCCTGTCC GACGGCAAAC CCAACGACCT CGATCTCTAC GACGGCCGCT ACGGCATCGA GGACACCCGC GTCGCCGTCG TCGAGGCACG CAACCGCGGT GTGGTGCCGT TCTGCGTTAC CATCGACCGC GAAGGCGCAA GCTACCTGCC GCACCTCTTC GGTCCGGCCG GCTACGCGGT GATCCGCCAA CCCGACGAGT TGCCCGCACG CCTGCCCATG TTCTATGCCC AGCTCACCCG CTGA
|
Protein sequence | MEETVGLLWH RLITRAAGGH FPKAAVRLRD IEKTAGVFFR ALGGDPGLRL AAATTDEHGA RRSLLQRIAG ADERVARARM DVSTLRLPPE LDVLPERGLN RDLYLWLAAV AAAPPAEDGD GHPALGPVPD EPIDEAATRE LRENQAATLR ALARWPGIQA RYRRLVDAMI AQRPRLDKLP PAEAEREKLI RLALHAPGSV AALPRLPAKA RATQPVLLWL SEFHASAAGG SGGAGAGGEL GEAGASRPGK DSSRQAHRVE RQEKMPEKHG MIIPFRAESL LSVAEFIKVQ RSTDDEPDDN AADAAANLDH LSITRDGERV ASKVRFDLDL PSAAEDDVVL GDGIPLPEWD YRKNLLLEDH VRLAELTPSI HDPRAAPCAL PEHLRRTARR LHRQFAALTP GRRWLKAQVD GTELDLDAVV RAATDRATGH HPSDQLYLSL EKRERDLACL ALADLSLSTD SWVSSEARVI DVIRDSLLLF GEALLATGDS FALCGFSSVK RSNVRFHRLK DFDQRFDDRA RGRIMAIKPG YYTRLGAAIR HATTILDRQR AARRILLILS DGKPNDLDLY DGRYGIEDTR VAVVEARNRG VVPFCVTIDR EGASYLPHLF GPAGYAVIRQ PDELPARLPM FYAQLTR
|
| |