Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4034 |
Symbol | |
ID | 7873679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4430891 |
End bp | 4432549 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643700970 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_002890993 |
Protein GI | 237654679 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCGG GACATGTGCC TCTACCGCAG GATTTTGCAG AGCTTTATCG CAAGAAAGGC TACTGGGAAA ACATCGGCGT CACGGAGATG GTCAAGCGCA CGGTCGAGCG GCATTCAAAT AAAGTGGCCC TAGTGGCGGG GATAAATCGC ATTACCTACC AATTCTTGTA TCAGGCCATT GAGCGTCTGG CCTGTAGATT TACTGAAGCA GGTTTGCGCC CGCTTGATCG GGTTATCCTC CAGTTACCCA ATACGCCTGA ACTCATCTAT GCATATCTTG CACTCACGCG CATCGGCGCG ATTCCCGTCA TGGCGTTGCG GGCACATCGC GAAACCGAAA TCAGGCATTT CATAAATGCT TCCGGGGCAG TGGCGTATGT GATCCCCGAT CAGGTCAATC ATTTCGACTA TCGCGTGATG GCCGATACAT TACGCGAAGA GTGCTCTAGC CTTCGGTACA TCTTCGTCGC TGGCACCCCG AGAGATGGGC AAGCCTCACT CAGTGAAATG ATCAATGAGG AACGAACCGC GCCTCAAATC GCGGCTGCAC TGGGTTCAAT CAGGGTCGAT CCGACCGACG TCGCGACTAT GCTCCTGTCG GGCGGCACCA CCTCAATCTC CAAGTTGATC CCACGCACGC ATAACGACTA CGTGCTAAAT GCCCGCCTCT GCGGAGAAGC CGCAAGCCTC GACGAACAGA CTGTTTTCAT GGCAATCCTG CCATTGGGCC ATAACTACAA TCTAGCCTGC CCTGGCTTTC TTGGTACGTT CTATTACGGC GGCACGGTGG TGCTTGCTCC CTCCGGTGAC GCCAATGAAG TGTTCTCCTT GGTCGAAAGG GAAAAGGTCA CGGTCATCGC GGCGGCGGTT CCGCTAATTA CAACTTGGCT GAACTCGAAT ACATACGAAC GCTATGACCT GTCCTCGCTC CGAATCATTC AGAATGGCGG CGCACGACTT GCGCCAGAAC TGCGTAAGCG AATCTTGCAA CAACTTGGCT GTTTCCCACA GGAGATCTAC GGTACTGCTG AGGGGTTGAT CAACATGACG CGCCTTGGAG ACGCGGAGCG GGCGGTTATC GAAAGCTCAG GCTCTCCGGT GTGCGAGGAC GACGAGATCA AGGTCGTTGA CGAATTCGGA GACGAAGTTC CTGACGGCGA AGCCGGTGAA CTTGCCACTC GTGGTCCCTA CACGATCCGC GGCTACTTCA ATGCACCGGA AATAAACAGC GCCGCTTTTA CAAAGGATGG CTTTTACCTG ATGGGCGATA TCGTTCGCAA AGAGGGCCGT TTTGTTTTTG CCGAAGGCCG GAAAAAGGAT TTTATCAACC GCGGCGGCGA AAAAATAAGT TGCGAGGAAG TCGAGAATTT GATTCTGCAG CACCCAAAGG TATTTCAGGT TTCGCTGGTG GCCATGCCGG ACGACACTTT CGGTGAGAAG GCTTGCGCAT TCGTTCGCCC GCGAGCCGAT GAGACCCTTG GATTCGAGGA GCTTATCCAT TTTCTTCGAT CAAAAAAAAT TGCAAGCTTC AAGCTTCCCG AGCGCCTCGA GATTATTGAA GCTTTTCCTG TAAGCCCCGT GGGTAAAATT CTAAAGCGAC AGTTGCGCGA GATTATCGCC GCAAGAATTA CAACAGAAAA AATCGCCGCC AAGAATTAA
|
Protein sequence | MLSGHVPLPQ DFAELYRKKG YWENIGVTEM VKRTVERHSN KVALVAGINR ITYQFLYQAI ERLACRFTEA GLRPLDRVIL QLPNTPELIY AYLALTRIGA IPVMALRAHR ETEIRHFINA SGAVAYVIPD QVNHFDYRVM ADTLREECSS LRYIFVAGTP RDGQASLSEM INEERTAPQI AAALGSIRVD PTDVATMLLS GGTTSISKLI PRTHNDYVLN ARLCGEAASL DEQTVFMAIL PLGHNYNLAC PGFLGTFYYG GTVVLAPSGD ANEVFSLVER EKVTVIAAAV PLITTWLNSN TYERYDLSSL RIIQNGGARL APELRKRILQ QLGCFPQEIY GTAEGLINMT RLGDAERAVI ESSGSPVCED DEIKVVDEFG DEVPDGEAGE LATRGPYTIR GYFNAPEINS AAFTKDGFYL MGDIVRKEGR FVFAEGRKKD FINRGGEKIS CEEVENLILQ HPKVFQVSLV AMPDDTFGEK ACAFVRPRAD ETLGFEELIH FLRSKKIASF KLPERLEIIE AFPVSPVGKI LKRQLREIIA ARITTEKIAA KN
|
| |