Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3203 |
Symbol | |
ID | 7874424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3491191 |
End bp | 3495126 |
Gene Length | 3936 bp |
Protein Length | 1311 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700132 |
Product | phosphoribosylformylglycinamidine synthase |
Protein accession | YP_002890175 |
Protein GI | 237653861 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain |
TIGRFAM ID | [TIGR01735] phosphoribosylformylglycinamidine synthase, single chain form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCGA TCCTCAAGCT GCGTGGCGCC CCGGCGCTAT CCTCCTCCCG TCTGGAACGT CTTTCCCGCG CCGTCGGCGA GGTGTTGCCC AAGCTGGCCG GACTGGCCGC GGAGCACTGG TACTTCGTCG AAGTCTCGGC TGCGCTGGAC GAGGCGGAGC GTGCGCGTCT GGTGGATCTG CTCGACGCCG GGCCGGCAAG CACGGCAGCC CCCACCGGCA CGCTGCGCCT GGTGGTGCCG CGCTTGGGCA CGATCTCGCC GTGGTCGTCC AAGGCCACCG ACATCGCGCA CCAGTGCGGT TTCGATAAGA TCGTGCGCAT CGAGCGCGGC ATCGCCTATT CGATCGACGC GCGCGGGGTG GACGGCAACA CCGCGCTGGC GGCGTTGCTG CACGACCGCA TGACCGAGTC GGTGCTCGAC AGCATGCAGG CGGCCGAGGC GCTCTTCCAC CACTACCAGC CGCAGCCGCT CACGACCGTG GACATCCTCG CCGGTGGTCG CGCCGCGCTC GAGCGTGCCA ACGGCGAGCT CGGCCTCGCG CTGTCCGAAG ACGAGATCGA CTACCTGGTG GAGAACTTCA GCCGCATGGG GCGCAACCCC ACCGACGTCG AGCTGATGAT GTTCGCCCAG GCCAACTCCG AGCACTGCCG GCACAAGATC TTCAACGCCG ACTGGGTGAT CGACGCGCGG CCGATGGAGA AGTCGCTGTT CGGCATGATC AAGGACACGC ACAAGGCCCA CCCCGAGGGC ACGGTGGTGG CCTACTCGGA CAACGCCTCG GTGATCGAGG GCGCGCGCAT CGCCCGCCTG TACCCGGACT CGGACGGCGC GTTCCGCTAC CACGACGAGG ACACCCACAT CCTCGCGAAG GTCGAGACCC ACAACCACCC GACCGCGATC TCGCCCTTCC CGGGCGCGGC CACCGGCGCC GGCGGCGAGA TCCGCGACGA GGGTGCGACC GGCCGCGGCT CTCGTCCCAA GGCCGGCCTG GCCGGCTTCA CGGTTTCCAA CCTCAACATC CCCGACTTTG GCCAGCCCTG GGAGAAGCCC TACGGCAAGC CCGAGCGCAT TGCCTCCGCG CTCGACATCA TGATCGAGGG CCCGATCGGC GCCGCCGCCT TCAACAACGA GTTCGGCCGC CCCAACCTCG CCGGCTACTT CCGCACCTTC GAGCAGGCAG TGCAGGGTGA GGTGCGCGGC TACCACAAGC CGATCATGAT CGCCGGCGGC TTGGGTAGCA TCCAGGCTCG GCAGGCCGAG AAACCCACCT TCCCGCCGGG CACGCTGCTG ATCCAGCTTG GCGGCCCGGG CATGCTCATC GGCCTGGGTG GCGGCGCGGC CTCGTCGATG GCGACCGGCA CCAACACCGC CGACCTCGAC TTCGCCTCGG TGCAGCGCGG CAACCCCGAG ATCCAGCGCC GCTGCCAGGA GGTCATCGAC GCGTGCTGGC AGCAGGGCGA GAACAATCCG ATCATCGCCA TCCACGACGT CGGTGCCGGC GGCCTGTCGA ACGCGATGCC CGAGCTCGCC GATCACGCCG GCCTCGGCGC CCACTTCGAG CTGCGCGAGG TGCACATCGA AGAGCCGGGC ATGAGTCCGC GCGAGATCTG GTCGAACGAA TCGCAGGAGC GCTACGTGCT CGCGATCGCG CCCGAGAGCC TGCCGATGTT CCAGGCCTTC TGCGAGCGCG AGCGCTGCCC CTTCGCGGTG CTTGGCACCG CCACCGCCGA TGGCCACCTC ACCGTGTCCG ATCGCCACTT CGGCAACAAG CCGGTGGACA TGGACATGAA GGTGCTGCTC GGCAAGCCGC CGAAGATGAC GCGCAACGTC TCGCGGCGCG CGGTGCATCT GCCGCCTTTC GACACCACCG GCTTCGATCT CCAGGAGGCT GGGATGCGCG TGCTGCGCGT GCCCGCGGTG GCCAGCAAGA GCTTCCTGAT CACCATCGGC GATCGCTCGG TCGGCGGCCT CACCGCGCGC GACCAGTTCG TCGGGCCTTG GCAGGTGCCG GTGGCCGACG TCGCGGTCAC CGCGATGAGC TTCCAGGGCT ACCGCGGAGA AGCCTTCGCG ATGGGCGAGC GCACGCCGCT GGCCTGCGTC GATGCGCCGG CATCGGGCCG CATGGCGATC GGTGAGGCAA TCACCAACAT CGCCGCCGCC GACATCGAGA AGCTCGGCGA CGTCAAGCTC TCCGCCAACT GGATGGCTGC CGCCGGCCAC CGCGGCGAGG ACGCGCGTCT TTACGACACC GTAAAGGCCG TGTCCGAGTT CTGCGTCAGC GCCGGCCTGT CGATCCCGGT CGGCAAGGAC TCGCTGTCGA TGCGCACTGC CTGGCGCGAT GGCGAAGAGG ACAAGCAGGT CGTCGCTCCG CTGTCGCTGA TCGCCACCGC CTTCGCCCCC GTGCAGGACA TCCGCAGCAC GCTCACGCCG CAACTGCAGC TGCATGAAGG CGTGGAGACC GAGCTGCTGC TGATCGACCT CGGCAACGGC AAGAACCGCC TCGGCGGCTC GGCCTTCGCC CAGGCTTACG AGTCAGTCGG CGAGCATGCG CCGGATGTCG ATCCCGCCCA GCTCGCAGCG TTCTTCGAAA CCGTGCAGCG ACTGCGCCGC GACGGCCTGC TGCTGGCCTA TCACGACCGT TCCGATGGCG GCCTCTTCGC CACCGTGTGC GAGATGGCCT TCGCCGCGAA GTGCGGCCTG TCGCTGATCC TCGATACCGT TTGCTACGAC CCCTACATGA TGGACGTCGA TGGTCTGGAG AAGAAGCCGG ACACGCTCAA GGGCCGTTTC GACGACCGCC TCTTCGCCGG TCTCTTCGCC GAGGAACTCG GCGCGGTGAT CCAGATCCGC CGCGACGACC GCTCCCGCAT CACCGAGGTG CTGCGCGCGG CCAGGCTCGC TTACCACTTC ATCGGCGAGC CCAACGACAA GGATGAGATC CGCTTCCGCC GCAACGCCAA ACTCGTGTTT GCCGCCAGCC GGGTCGAGCT GCTGCAGGCG TGGTCGGAGA CCAGCTATCG CGTCGCCAAG CTGCGCGACG ATCCCGAGTC GGTGCAGCAG GAATTCGACG CGCTCGCCGA CGCCGGCAAC CCCGGCCTGT CGGTGGCGCT GTCCTTCGAC GTCAAGGAGG ACGTGGCGGC GCCCTTCATC GCCAGCACTG CGCGTCCCAA GGTCGTCGTG CTGCGCGAGC AGGGTGTCAA CTCCCAGTTC GAGATGGCGG CAGCCTTCGA GCGTGCCGGC TTCACCCCGG TCGACGTGCA TATGTCGGAC CTGCAGGCCG GTCGCATCGA CCTGGCCGAC TTCCATGGCC TCGCAGCTTG CGGCGGCTTC TCCTATGGCG ACGTGCTCGG CGCCGGGCAG GGCTGGGCCA AGTCCATCCT GTTCAACCCC CGCCTGCGCG ACGAGTTCGC GGCTTTCTTC GGGCGCAGCG ACACCTTCGC CCTCGGGGTG TGCAACGGCT GTCAGATGAT GGCCCACCTC GCGCCGATCA TCCCCGGCGC CGAAGCCTGG CCCACCTTCC ACCGCAACCG CAGTGAACAG TTCGAGGCCC GCTTCGTGAT GGTGGAGGTC GCAGACAGCC CCTCCATCCT GCTCCAGGGC ATGGCCGGCA GCCGCATGCC GATCGTGGTC AGCCATGGCG AGGGCAGGGC GGTGTTCCAC AGCGAGGCCG ACCGCGAGCG CGCCCTCCTG GCGCTGCGCT ACGTGGACAA CCACGGCAAT CCGGCACAGA CCTATCCGGC CAACCCCAAC GGCTCGCCGG AGGGTGTGAC GGGCTTCACC ACTGCCGACG GGCGCTTCAC GATCATGATG CCGCACCCCG AGCGCACCGC GCGGACCCTG CAGATGTCGT GGGCGCCGCA GTGGCTGGTC ACGGACAGTC CCGACGCCTC GCCCTGGCTG CGCATGTTCC GCAATGCCCG CTTGTGGCTC GGCTGA
|
Protein sequence | MTSILKLRGA PALSSSRLER LSRAVGEVLP KLAGLAAEHW YFVEVSAALD EAERARLVDL LDAGPASTAA PTGTLRLVVP RLGTISPWSS KATDIAHQCG FDKIVRIERG IAYSIDARGV DGNTALAALL HDRMTESVLD SMQAAEALFH HYQPQPLTTV DILAGGRAAL ERANGELGLA LSEDEIDYLV ENFSRMGRNP TDVELMMFAQ ANSEHCRHKI FNADWVIDAR PMEKSLFGMI KDTHKAHPEG TVVAYSDNAS VIEGARIARL YPDSDGAFRY HDEDTHILAK VETHNHPTAI SPFPGAATGA GGEIRDEGAT GRGSRPKAGL AGFTVSNLNI PDFGQPWEKP YGKPERIASA LDIMIEGPIG AAAFNNEFGR PNLAGYFRTF EQAVQGEVRG YHKPIMIAGG LGSIQARQAE KPTFPPGTLL IQLGGPGMLI GLGGGAASSM ATGTNTADLD FASVQRGNPE IQRRCQEVID ACWQQGENNP IIAIHDVGAG GLSNAMPELA DHAGLGAHFE LREVHIEEPG MSPREIWSNE SQERYVLAIA PESLPMFQAF CERERCPFAV LGTATADGHL TVSDRHFGNK PVDMDMKVLL GKPPKMTRNV SRRAVHLPPF DTTGFDLQEA GMRVLRVPAV ASKSFLITIG DRSVGGLTAR DQFVGPWQVP VADVAVTAMS FQGYRGEAFA MGERTPLACV DAPASGRMAI GEAITNIAAA DIEKLGDVKL SANWMAAAGH RGEDARLYDT VKAVSEFCVS AGLSIPVGKD SLSMRTAWRD GEEDKQVVAP LSLIATAFAP VQDIRSTLTP QLQLHEGVET ELLLIDLGNG KNRLGGSAFA QAYESVGEHA PDVDPAQLAA FFETVQRLRR DGLLLAYHDR SDGGLFATVC EMAFAAKCGL SLILDTVCYD PYMMDVDGLE KKPDTLKGRF DDRLFAGLFA EELGAVIQIR RDDRSRITEV LRAARLAYHF IGEPNDKDEI RFRRNAKLVF AASRVELLQA WSETSYRVAK LRDDPESVQQ EFDALADAGN PGLSVALSFD VKEDVAAPFI ASTARPKVVV LREQGVNSQF EMAAAFERAG FTPVDVHMSD LQAGRIDLAD FHGLAACGGF SYGDVLGAGQ GWAKSILFNP RLRDEFAAFF GRSDTFALGV CNGCQMMAHL APIIPGAEAW PTFHRNRSEQ FEARFVMVEV ADSPSILLQG MAGSRMPIVV SHGEGRAVFH SEADRERALL ALRYVDNHGN PAQTYPANPN GSPEGVTGFT TADGRFTIMM PHPERTARTL QMSWAPQWLV TDSPDASPWL RMFRNARLWL G
|
| |