Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1866 |
Symbol | |
ID | 7084289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2104897 |
End bp | 2106789 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698889 |
Product | hypothetical protein |
Protein accession | YP_002355514 |
Protein GI | 217970280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0316432 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACA TCCTCAAGCG CTTCACCACC GACGCGAGCG CATTCACCGA ACTCAAGTTC GTGGACGCGA AGGGCACGGC AGCCGCGGTC GATCTCGCCG ACGCGACCCG GGGCGAGGCC ACGATCTCCG CCGGCATGCT GGCCGCCCTC GCCGACCTGC AGCCGAAGGC CGCGCCCGAC GAGCCGCTCA CCGAGCCGGT CTTCAGCAAG GCAACGAGCA CAGGCAGGGG AATGCTGGCG CATGTCGAGC GGGTGCACGA CAGCTTCCTC AACGTGCGCC GCGACCTCCT CGAGCTGCGC GCCAGGGCCA ACGCTCGCCA GAATCAGGCG AAATCGCTCA TCGACCGCGT GTGCGACTTC CTCGACGACT ATCTCAAGCC GGAAGAGCTG CGCCGGCTCG GCGTCATCGA CGACGAGGCC CAGGCACGTC CCTTCCGCCT GCTCGTCGCG CCCGATCTGC ACAACGCCGA GGGCGTGCTG CTGCGCCCCC GACTCGACCT CGCCCTTCCC CGCCCGGGCG AGGACGACAT CGAAGGCGTC AAGCACAACG ACGAATCGGC GCTCTTCCAT GCCGGTAAGG TCGTTGCCGA CGACATCCGC CAGGTCGAGG CGCTGCGCGG CGCCCTGACC ACGCTGCGCC GCCGCCATCA GGAGGCGCTC GAGGACCTGC GCACCCGCCT GGTCGCACTC GAGGCCGAGC TACCCGGCGA GCTTCGCCGG CTCGACGTCC TGGAGCGCGA ACGCACCGAG ACCCTGGACG ACTACGCGGT TGCACAACGC CTGCTGGCCG AACACTGGCG CGAGGTCGAA GCCGCCCACG CCGAGCGCCG GCGAATCATC GAGGCTCATC AAGGCCTCTT CTACGTGAAG GTGCGCGAGA CCCCGCTCGG CCGCAGCCTG CCCGACCCGC TCGAGCTGCG CCCGAGCAGC CCCGACGAGC TCGTGCCCGG CTGCGCAGGA CGCGACACCG CGCTGCCGGC TGCGCTCGCG CCTTTCATCG AGGCGGTGTA CGACATCCCG GCGGCCGATT GGGCACACCT GCGGCCACTC GGCCACCTGC TGCCGGGGCG CACGATCCTC GCCGGCCTGG TCGAGATGCG CCGCCAGAAG CTCGCACTGC GCCTGAACCG CCCCCCGGAC GCCGGCCTCG CCTTGCTGTC CGGCCTGGTG CAGCAGAACA GGGCCCTGGT GCGCGACATC GCCGCGCGCC CCTTCAGCGC CGGCGCACTC GGCGAGTTGC AGCGCCAGGC AGGCGCGATC CTCGCCCTCG ACGACCTGCT CGCGATCCCC TCGCCACAGC TGCGCGACCC GGCGCGTGCC CTGCACCAGC GCCTGGATAC CGCCGCCGGC TGCCTGCTCG AGCGTCTGCG CACGATCTCT CCCTCGATCC GGCTGAACTG GGCGAGCGCG GCCGAGGCTG ACCGGCTCGC CGTCGAGGCC CCCGAGCGCT GGCCCGGCCT CGCCGAGGCG GAGGAGCGCG ACTTCAACGG CGTGCGCACG CTGGTCGAGC TCGTCGCCTG GTGGTTCCGC CAGCTCGATG CCGACGCCTC CGCCGCCGCG CACGGCGCGA TGCGCAACCT GGTGCGCGCC TGCCTGCTGC TCGCGGCCAG CGACGACCCG CAACAGCTCG TGCAGGGGCG CCTGCAGAGC ATCCCGGGGC GCTTCCGCCT CGGCGAGGCG CTGCGCCTGA AGCTCGACCG CGAGGCCGCG CCCGGCACCC TGCTCCAGCT CTTCGACGAC GACCAGCGCG TGATCGCGAC CCTGCGTGTC GACGACCATG ACGACCAGGG CACCGTCGCC TCGGTCGCCT CCATCCTCGA CCCCGAGCTC GAACGCAACC CCGGCGCCGT GCTCGCCACG GGACTGCACA TCAGCGGAGT CGATCGAGGC TGA
|
Protein sequence | MSDILKRFTT DASAFTELKF VDAKGTAAAV DLADATRGEA TISAGMLAAL ADLQPKAAPD EPLTEPVFSK ATSTGRGMLA HVERVHDSFL NVRRDLLELR ARANARQNQA KSLIDRVCDF LDDYLKPEEL RRLGVIDDEA QARPFRLLVA PDLHNAEGVL LRPRLDLALP RPGEDDIEGV KHNDESALFH AGKVVADDIR QVEALRGALT TLRRRHQEAL EDLRTRLVAL EAELPGELRR LDVLERERTE TLDDYAVAQR LLAEHWREVE AAHAERRRII EAHQGLFYVK VRETPLGRSL PDPLELRPSS PDELVPGCAG RDTALPAALA PFIEAVYDIP AADWAHLRPL GHLLPGRTIL AGLVEMRRQK LALRLNRPPD AGLALLSGLV QQNRALVRDI AARPFSAGAL GELQRQAGAI LALDDLLAIP SPQLRDPARA LHQRLDTAAG CLLERLRTIS PSIRLNWASA AEADRLAVEA PERWPGLAEA EERDFNGVRT LVELVAWWFR QLDADASAAA HGAMRNLVRA CLLLAASDDP QQLVQGRLQS IPGRFRLGEA LRLKLDREAA PGTLLQLFDD DQRVIATLRV DDHDDQGTVA SVASILDPEL ERNPGAVLAT GLHISGVDRG
|
| |