Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2485 |
Symbol | |
ID | 7874168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2683961 |
End bp | 2686000 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699407 |
Product | hypothetical protein |
Protein accession | YP_002889464 |
Protein GI | 237653150 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGGA CCAAGGACAA GGAACCTTCC GAGCAACGGC GTCAGCGCAC TTGCGCAACG ATGGAGGAGC ATCGTCGGCT CGCCCATCTC TATCCCGAGT ATCGCCGCAG GCGCCGGGAG ATCGAGCTCG AGACGCGCCA GTTCATCGCG CGCTACGCCG CCGAGGGCCT GCGCACCGGG ATCGTGCGCA TCCCGGTGGT GGTGCATGTG GTGTGGAACA CCGCGGCGCA GAACGTAAGC GATGCGCAGA TCCAGTCGCA GATCGACGTG CTGAACGCCG ATTTCCGCCG CACGAACGCG GACGCCGCGA GCGTGCCGGC CCACTTCGCC GGCGTGGCCG CGGACGCCCG CATCGAGTTC GCGCTCGCCG TGCGCGACCC CAACTGCGGC GCGACCACCG GCATCACCCG CACGAACACG GCGACGACCG GCTTCACCCG GGCGACCCGC AACAACGTCA AGTCGGCGGC CACCGGCGGG GCCGATCCGT GGCCTTCGGA CCGCTACCTC AACATGTGGG TGGCCAACTT CACCGACGAC CTGCTCGGCT TCGCCACCTT CCCCGGCGGC CCGGCGGCGC TCGACGGCTT CGTCGTCGAT ACGCAGGCCT TCGGGACGAT GGGCACGGCC GCCGCGCCTT TCAACCTCGG CCGTACCGCC ACCCACGAGA TCGGCCACTG GCTCAACCTG ATCCACATCT GGGGCGACGA CACCGCGCAG CCCGACCAGT GCAGCGGCAC CGACATGTGC GCCGACACAC CCAACCAGGC GGACGAGACC TACGGCAACC CCGCGGGCAT CCGCATCTCC TGCGGCAACG GGCCCAACGG CGACATGTAC ATGAACTACA TGGACTACAC CGACGACGTC GGCATGTTCA TGTTCTCCCA GGACCAGGTG ACCCGCATGA ACGCGACGCT GATGGTCGCG CGCACCGGCA TCCTGGCCTC CGACGGGCTG GTGCCGGTGG GCGGCGGCTC GCCCGCGCCC GACCTGTGGA TGCAGGACAA CGCCGACGAC GTCGGCGCCG AGCCGGACGC GAGCACCAAC CCGATGTGGA TCAGCGACGA CATCTGGGTG CGCAACGTCG CCGACGGCCT CACCAACCAG GACCACCAGA ACCCCAACGG CGAGCAGACG AACTACGTCT ATGTGCGCGT GCGCAACCGC GGCTGTGCCG GCGCCGCGGC GCAGAGCGGC ACGCTCAAGC TGTATTGGGC GAAGGCGTCG AGCTCGCTGT CGTGGCCGGC GCCCTGGGAC GGCAGCGTGG CCAGCCCGGC GCTGATGGGC GGGCTGGTCG GCAGCCAGGC GGTGAGCGTG AACGGCGGCG ACACCGAGAT CGTGGAGTTC GCGTGGACGC CGCCGGATCC TTCGGACTAT GCGGCCTTCG GCGCGGACAA GGCGCACTTC TGCCTGCTCG CCCGCATCGA GACCTCCGCC ACCGCGCCCT TCGGCATGAG CTTCGCCGAG ACTGCCAACC TCTACGCCAA CGTGCAGAAC AACAACAACA TCGTGTGGAA GAACATCTCC ATCGTCGACA CCGACGGCGA CGGGGCGCGC CACGCGGACG TGGTGATCGG GCGCTTCACG CGCGAACGGC GCGCCACGCG GCTGCTGTTC CGCACGCCGA AGCGGCGCGG CTTCTCGCTG TTCGACTGGG GCCACCTGAT GGTCGAGTTC CGCGGCGAGG CCCTGCTCGA ATGGGTGAAG GAGGGCGTGA AGGGCGATGG CTTCGAGCGC CTGCAGGACG GGCGCCTGTT CATCGCGCGT GCGGGCGCGG AGGTCGTCGG CCCGCCGCTC AAGCCCGGCA GCTTCGGCAC CGTGCATGTG CAGTTCGTGC CCGATGGCCG GCGCCCGACC GGCGCCCAGG TCTTCGAGCT CGATCTGATC GAGCTCGACG CCAAGGGGCG CGCCGTCGGT GGGCAGCGCC TGCTGCTGAA GACCGGCCGT GCGCCCGCGA AGCCTTGCTG CCGACGTGAA GCGGGAAGCT TCGACGGCGT CAACTGGACG CCGGACTCGA CCTGCCGATG CGGCTGCTGA
|
Protein sequence | MARTKDKEPS EQRRQRTCAT MEEHRRLAHL YPEYRRRRRE IELETRQFIA RYAAEGLRTG IVRIPVVVHV VWNTAAQNVS DAQIQSQIDV LNADFRRTNA DAASVPAHFA GVAADARIEF ALAVRDPNCG ATTGITRTNT ATTGFTRATR NNVKSAATGG ADPWPSDRYL NMWVANFTDD LLGFATFPGG PAALDGFVVD TQAFGTMGTA AAPFNLGRTA THEIGHWLNL IHIWGDDTAQ PDQCSGTDMC ADTPNQADET YGNPAGIRIS CGNGPNGDMY MNYMDYTDDV GMFMFSQDQV TRMNATLMVA RTGILASDGL VPVGGGSPAP DLWMQDNADD VGAEPDASTN PMWISDDIWV RNVADGLTNQ DHQNPNGEQT NYVYVRVRNR GCAGAAAQSG TLKLYWAKAS SSLSWPAPWD GSVASPALMG GLVGSQAVSV NGGDTEIVEF AWTPPDPSDY AAFGADKAHF CLLARIETSA TAPFGMSFAE TANLYANVQN NNNIVWKNIS IVDTDGDGAR HADVVIGRFT RERRATRLLF RTPKRRGFSL FDWGHLMVEF RGEALLEWVK EGVKGDGFER LQDGRLFIAR AGAEVVGPPL KPGSFGTVHV QFVPDGRRPT GAQVFELDLI ELDAKGRAVG GQRLLLKTGR APAKPCCRRE AGSFDGVNWT PDSTCRCGC
|
| |