Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0249 |
Symbol | |
ID | 7084370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 283268 |
End bp | 285259 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643697291 |
Product | UvrD/REP helicase |
Protein accession | YP_002353940 |
Protein GI | 217968706 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0210] Superfamily I DNA and RNA helicases |
TIGRFAM ID | [TIGR01074] ATP-dependent DNA helicase Rep |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000268408 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCC TCCTCAACGC GCCCCAGCGC GAAGCGATCC GCTATCTCGA CGGCCCCTGC CTGGTGCTGG CCGGCGCCGG CAGCGGCAAG ACGCGGGTGA TCACGCACAA GATCGCGCAC CTGATCAACG AGTGCGGCAT CAGCCCCAAC AACATCGCTG CCATCACCTT CACCAACAAG GCCGCCAAGG AGATGCAGGA GCGCGTCGCC CACATCATGG GCGGGCGCGT GCCGGGTGGG CTCACGGTGT GCACCTTCCA CGCGCTCGGC GTGCGCATCG TCCGCCAGGA GGCCAAGCAC TGCGGCTTGA AGCCGCAGTT CTCCATCCTC GACGCCTCCG ATACCGTGCA GATCGTCTCG GACGTGGCCG GCGACAGCGA CAAGGGCATC GCCAAGCAGA TGCAGTGGCA GATCTCGTCG TGGAAGAACG CGATGATCAC GCCCGAGGAG GCCGCCCAGC TCGCCGACAA CGAGATCGCC TCGGTCGCCG CCAGGCTCTA CAAGGAATAC GAGCGCACCC TGCGCGCCTA CCAGGCGGTG GATTTCGACG ACCTCATCGC GCTGCCGGTG CGCCTCTTCG ACGAGCATCC GGAAGTGCGC GAGCGCTGGC AGAACAAGCT GCGCTACCTG CTGGTGGACG AGTACCAGGA CACCAACCGC GCCCAGTACC GCTTGCTGAA GCTGCTGTCC GGCGTGCGCG GCGCCTTCAC CGCGGTGGGC GACGACGACC AGGCGATCTA CGCCTGGCGG GGCGCCGACG TCGAGAACCT GAAGCTGCTG CAGCAGGACT ACCCCAAGCT CAAGGTCATC AAGCTCGAGC AGAACTACCG CTCCTCGCGC CGCATCCTCG AGGCCGCCAA CACCGTCATC GCCAACAACG AGAAGCTCTT CGACAAGCGC CTGTGGTCCG AGCACGGCCA GGGCGAACAG ATCGTCGTCA CCAACTGCCG TGACGCCGAG CACGAGGCCG AGTGGGTGGC GACCAAGATC ACCGCGCACA AGTTCGAGCA CCGCACCCGC TTCAAGGACT ACGCCATCCT TTACCGCGGC AACCACCAGG CCCGCCTCAT CGAGCAGCAG CTGCGCAACC ACCGCATCCC CTACGTGATG TCGGGCGGGC AGAGCTTCTT CGACAAGGCC GAGATCCGCG ACCTCATCGC CTGGCTGCGC CTGCTGGTGA ACGAGGACGA CGACCTCGCC TTCATCCGCG CCATCACCAC GCCGCGCCGC GGCATCGGCG CCGCCACCAT CGAGGCCCTC GGCGCCTACG CCGGCCATCG CCACAGCAGC CTGTTTGCCG CGGTGTTCGA GGAGGGGCTC ACCCAGCATC TGAATGCCAA GCAGCTGCAG GGCGTGCAGG AGTTCGCCGC CTACATCAAC CGCCTGCAGT ACCGCGCCCC GCGCGAGCCC GCCGCGCAGC TACTCGAAGA CCTGCTCGGC GCGATCCGCT ACGAGGCCTG GCTGTTCGAG CACTGCGACA CGCGCGAGGC CGAATCCAAG TGGAGCAACG TGCGCGACTT CGTCGGCTGG CTCGGCCGCA AGGGCGAGGA GGACGGCAAG AACCTGCTCG AGCTCACCCA GACCATCGCG CTGATGTCCA TGCTCGACAA GGAAGACCCC GACTTCGACG GCGTGCAGAT GGCCACCCTG CACGCCTCCA AGGGCCTGGA GTTTCCGCAT GTGTTCCTGG TCGGCGTCGA GGAGGGCCTG CTGCCGCACC AGAGCAGCAT CGACGAGGAC AAGGTCGAGG AAGAGCGCCG CCTGATGTAC GTCGGCATCA CCCGCGCGCA ACGCAGCCTC AACCTGACCT GGTGCGAGCG GCGCAAGTCG GGCAAGGAGT TCCGCACCTG CGAGCCGAGC CGCTTCATCG CCGAGATGGG CGGCGACATC AAGATGAACG ACCGCAAGAC CGCGCAGCCG GTGACGAAGG AAGAAGGCAA GGCGCGGCTG GCGAACCTGA TGGCGATGTT CGAGAACCGC GACAAGGCTT GA
|
Protein sequence | MSALLNAPQR EAIRYLDGPC LVLAGAGSGK TRVITHKIAH LINECGISPN NIAAITFTNK AAKEMQERVA HIMGGRVPGG LTVCTFHALG VRIVRQEAKH CGLKPQFSIL DASDTVQIVS DVAGDSDKGI AKQMQWQISS WKNAMITPEE AAQLADNEIA SVAARLYKEY ERTLRAYQAV DFDDLIALPV RLFDEHPEVR ERWQNKLRYL LVDEYQDTNR AQYRLLKLLS GVRGAFTAVG DDDQAIYAWR GADVENLKLL QQDYPKLKVI KLEQNYRSSR RILEAANTVI ANNEKLFDKR LWSEHGQGEQ IVVTNCRDAE HEAEWVATKI TAHKFEHRTR FKDYAILYRG NHQARLIEQQ LRNHRIPYVM SGGQSFFDKA EIRDLIAWLR LLVNEDDDLA FIRAITTPRR GIGAATIEAL GAYAGHRHSS LFAAVFEEGL TQHLNAKQLQ GVQEFAAYIN RLQYRAPREP AAQLLEDLLG AIRYEAWLFE HCDTREAESK WSNVRDFVGW LGRKGEEDGK NLLELTQTIA LMSMLDKEDP DFDGVQMATL HASKGLEFPH VFLVGVEEGL LPHQSSIDED KVEEERRLMY VGITRAQRSL NLTWCERRKS GKEFRTCEPS RFIAEMGGDI KMNDRKTAQP VTKEEGKARL ANLMAMFENR DKA
|
| |