Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1862 |
Symbol | |
ID | 7084285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2095527 |
End bp | 2098787 |
Gene Length | 3261 bp |
Protein Length | 1086 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698885 |
Product | hypothetical protein |
Protein accession | YP_002355510 |
Protein GI | 217970276 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0997362 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGCCT ACGCACGTCC CCCCGCACGC GTGAGCCAGC CCGGCGAGAA CATCGAGCGC GAGGCCGAGG CGCACGCCGC CGCCGTCGCC CCGACGAACT CGCCCCATCC GACGCCGGAC TCGGCCCCCG CGCCAGTACC GACCCGCAGC ACCGAAGACA GCGCGGGCGC GTCCTCGGCT GCGAACGTCG GCTCCACCCC CCCACCCGCA TCCGCCTCCG CCTCCGCCGA CGCGAACTCG CCCTCCGCGC GCCTTCCCGC CTATGCCGCC GGCCGTGTCG AAGCGCGCCG CGGGCGCGGC GATCCGCTGC CGGCCGATGC CCGCGCCCCG CTCGAGGACC ACTTCGGTCG CGACTTCGCC GACGTGCGCC TCCATTCGGA CGCCGAAGCG GCGCGCCTCA CCGCTGCGCT CGGCGCACGC GCGTTCACGT CGGGGCGCGA CATCTACCTC GCACCGGGCA CCGTCGCCTC CGACACCGAG GAGGGGCGTC ACCTCCTCGC GCACGAACTC GCCCACGTCG TGCAGCAGGA TGGCCAGGCC GCGGCGACAC TCGCGCGCGC GATCGAACCG CCCCCTCCGG CCCGCTACCA CGAGACCGAG CGCAGCGCTG CCGCCCTCGC CGCACTGGAT CGCCTCGAGA TCCCCGCGGT GAAGGCGCGC CACCTGCCGC TCTACAGCGG CCTCGCGCAG GCGCACCAGC TCAAGCGCGT GCGTGGCTAC GGACGCGAGG GCGCCGACCA GCGCTCGGTA TGGAGCCGCC AGGTCGAGGT CGGCGCCGAC GCCGTGCGCG AACGCCTGAC CGAACGCGGC ATCCCGGTTC CCGCCGACCC CAGCGGCCGC GTACGCCTGA AACTGCATCG CGGCGCACGT ACGACGTCGA AACGCCTGTC CGAGCTGCAG ACCCTGCTGC GCATCCCCGA ATGGGACCGC GAGGGCCGCA GGCGCGATTT CCAGGTCGAC CACATCGTCG AACTGCAGGT CTCCGGCCAG ATGGGCTCGG GGGTGGGCAA CAGCGTCGAG AACATGGAGC TCCTCGACCA GCCCTCGAAC TCGAGCTCGG GCGGCACGAT CCGCAGCAGC ATCTATCGCA AGCTCGACGG CTTCCTCGAT ACCCTGGAGC CTCGACCCGA CCGAGCAAGC TTCCTGCGCG GACACGACCT CGTCTTCACC AGCGTCGCAG CAACCGGCGC GGGCTCGGCG GCGACGGGCT CGGCGTGGTG GACCAGGGCC GAGATCGAGC AGGCCCACAC CCTCGGCACC GCGAGCGCCC CCACCGCCGC CGAAGACGGC GACCGCGCCG GCAGCGCCGA GGAGTTCATC CTCAGCGCCG CCCCCGGCGG CATCGCCGTG GGCCGCTTCC GCCACAGCCC CGGTGCCGCG CCCACGATCG GCGCCACCCA GGCGCGCGCG CTCGCCGGCC TGCGCATCAC GGCGATCGCG CTCACCGACC TTACCGGCAC GCCCGACGGC CCCGTCGGTA CGCTCTCGGC AGAATGGGAC CTGCCCGCCG ACTGGCAACC CGCCAATCCT GCGATCACGA TCTCGCTCCA GGGCGATGGC GAGTACCGCG GCTACCCGTC GGCGCTTCCC GGCCTCGATC TCGAGTACCG CCATCTCAGC CCGGTGAGTT TCACCCGCAT CTCCACCGAA GACGGCGAGC TGTACGCCGA GGGCACGCTC ACGCCCTCGA TCCCCATCCT CGCAGCTCCG CTGACGGTGC AACTGCGCGG ACGCGAGCTC GGCTTCGCGC TCGACTACGG TCCGGAACAG GTCAGCCTGC CGATCCCGCG CACGACGATC GACGACGCCT GGGTCAGCGT CTTCTATTCG ACCACCCGCG GACTGGGCGT GGGAGGCGAC ATCCTGTATT CGATCGAGGG GGCAGGCAGC GGCGAGCTCG GCGCCTCGGT CTCGACCGGC GGGGGCGTCG CCTTCGAGGG CGGCTTCACT TTCGATCCCG CCCTCTTCGA CCGCGCTCGC GTCCGCGCCT GGTGGCGTGA TGGTCGCCTG GGCGCCGAAG GCACGATCGG CATCGATACC CCGGACAAGA TCCGTGGCAT CCGCAGCGCG ACCGCCTCGG TCCGTGTCGA TGAAGGGCGC TGGTCGTTCA ACGGCAGTGC CGAACTCTCC GTGCCCGGTC TCAGCCAGGC CTCGATCGCG ATCCGCCAGG GCGAGGGCGG ACTCGAGCTC GCGGGCGACG TCGCCCTCGC CACCAACCCC GCCATCCGCT CCGGCACCCT GCACGTCGAA TGCGCGCAGA CCGACGGCGA GTGGAAGGTC GCGGCGAGCG GCACCGCTCA GCCTGCGATC CCCGGCGTCG ACGCGGAGCT CGCCGTCACC TATGCGGACG GCGCCTTCGA CGCGCGCTTC TCCGGCGCCT TCCGCCGCGG CATGCTCTCC GGCCAGCTCA GCGTCGGCGC CACCAACCGC GCCGTCGCTG CGGACGGGAG CCCCGGCGGC CCGCCCAGCG CTCCGGATGC GCCCATCGTG GTCTACGGCA GCGGCTCCGC GACCGTGCGG ATCGCCCCCT GGCTGCAGGG CAGCGCGGGC CTGCGGGTCG CTCCCGACGG CGAGCTCACC GTGTCCGGCG AGATCGCCCT GCCCGATTCG CTGGAGATCT TCTCCCGGCT GGAGTACGAC AAGCGCCTGT TCGGCATGTC GACGCAGATC CCCATCGTCC CCGGCGTGGT CGCCGAGGTC GGCGGCAATC TCAGTGCCAA CGCCAGCGTC GGCCCCGGGG CACTGGACCG GCTCGCTGTC CGCATCGAAT ACAACCCCGC ACACGAGGAT GACACCCACG TCACCGGCGA GGCCCACCTC GAGGTTCCGG CGCAGGCAGG CCTGCGGCTC GGCGCGCGTG CCGGCGTCGG CCTCGGCATC ACCGGCGCAA GTGCCACCGG GGGCCTCGAG ATCGGCGGCG CCCTCGGCAT CGCAGGGGCT GCCGAGGCTG GCGTGCGCAT CGACTGGATG CCCTCGCGCG GCCTCGAGAT CGACGCCGAG GCCGCGCTCC ACGCCCAGCC GCGCTTCCGC TTCGACGTTT CCGGCTACGT CGCCGTCACC GTGCTCGGCG CCAGCCTCTA CGACGAGCGC ATCGAGCTCG CCGCCTATGA GCTCGGCTCT GGCCTCGAGT TCGGCGTCCG CTTCCCCGTC ACCTACCGTG AGGGAGAACC CTTCGACCTC TCCCTCGACG ACCTCGAGTT CCAGGTCCCC GAGGTGGATC CGGCGGCGAT GATCAGGCAG CTGGGGGAGA CGATCTTCTG A
|
Protein sequence | MPAYARPPAR VSQPGENIER EAEAHAAAVA PTNSPHPTPD SAPAPVPTRS TEDSAGASSA ANVGSTPPPA SASASADANS PSARLPAYAA GRVEARRGRG DPLPADARAP LEDHFGRDFA DVRLHSDAEA ARLTAALGAR AFTSGRDIYL APGTVASDTE EGRHLLAHEL AHVVQQDGQA AATLARAIEP PPPARYHETE RSAAALAALD RLEIPAVKAR HLPLYSGLAQ AHQLKRVRGY GREGADQRSV WSRQVEVGAD AVRERLTERG IPVPADPSGR VRLKLHRGAR TTSKRLSELQ TLLRIPEWDR EGRRRDFQVD HIVELQVSGQ MGSGVGNSVE NMELLDQPSN SSSGGTIRSS IYRKLDGFLD TLEPRPDRAS FLRGHDLVFT SVAATGAGSA ATGSAWWTRA EIEQAHTLGT ASAPTAAEDG DRAGSAEEFI LSAAPGGIAV GRFRHSPGAA PTIGATQARA LAGLRITAIA LTDLTGTPDG PVGTLSAEWD LPADWQPANP AITISLQGDG EYRGYPSALP GLDLEYRHLS PVSFTRISTE DGELYAEGTL TPSIPILAAP LTVQLRGREL GFALDYGPEQ VSLPIPRTTI DDAWVSVFYS TTRGLGVGGD ILYSIEGAGS GELGASVSTG GGVAFEGGFT FDPALFDRAR VRAWWRDGRL GAEGTIGIDT PDKIRGIRSA TASVRVDEGR WSFNGSAELS VPGLSQASIA IRQGEGGLEL AGDVALATNP AIRSGTLHVE CAQTDGEWKV AASGTAQPAI PGVDAELAVT YADGAFDARF SGAFRRGMLS GQLSVGATNR AVAADGSPGG PPSAPDAPIV VYGSGSATVR IAPWLQGSAG LRVAPDGELT VSGEIALPDS LEIFSRLEYD KRLFGMSTQI PIVPGVVAEV GGNLSANASV GPGALDRLAV RIEYNPAHED DTHVTGEAHL EVPAQAGLRL GARAGVGLGI TGASATGGLE IGGALGIAGA AEAGVRIDWM PSRGLEIDAE AALHAQPRFR FDVSGYVAVT VLGASLYDER IELAAYELGS GLEFGVRFPV TYREGEPFDL SLDDLEFQVP EVDPAAMIRQ LGETIF
|
| |