Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0760 |
Symbol | |
ID | 7084151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 843457 |
End bp | 844440 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697785 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002354427 |
Protein GI | 217969193 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCGG CTGCCCAGCC TTCGGGAAAC GCCCCGTACT CCTACACCCT GCGCCACACG CTCGATGCCG ACGAGCACGC CGCCTGCCTG ACCAACTGGC ATCAACGCTA CGACCAGCTC ACCGCGGGCG CCTTCGACGG CGTGTTCGAG GAGTTCTGCT TCGGCAGGGT GCAGCTGTTC CGCGAGGGCC TGAACCAGTC GGTGCATCAG GCTGGCGGGG CGTGGCCCGG CTCGCGCACC TTCGCCGTGC CGGTCGCGAT CGAGGGCACG GGCTGGTTCG GCGGCGAGAT GTATGACGCG CACTCCATGC TGACACTCGG AGGCGACGAC GAGCTCGACT TCCGCACCCC GCGCCGGCTC GAGATCCTCG CCTGCACCGC CGACTCGGCC GCGCTCCACG ACTACGCGCA TCAGGTGGAC CACCGCAACC TCGAGGCCGA GCTCGCCGGG CGCAAGCTGG CACCGACCAC CCCGGCGGGA ATCGCGGCGC TCGGGCAACT GCTGGCGACC ATGACGGCCA GCCTGCGGGC GACCCCCGAG CTGCTTCTGC ACCCGCAGAT GCGCAAGGCG ATGGAACAGG CGCTGTTCGC GACCCTACTC GACACGCTCT CCTGTGGCGG CGGACGGGCC GGTGCACCGT CCTGTCGCGC CCGCCAGCAG GTGGTGGCGC GCGCGCGCGC CTACATGGAG GCCCACATCG ACGAGCCGAT CACGGTCGCC GACCTGTGCA TCGAGCTCGG CATCTCGCGC CGCACCCTGC AGTACAGCTT CCAGGACGTG CTCGATCTCA ACCCGGTCAA GTTCCTGCGC GCGATCCGCC TGAATGCGGT GCGCCGCTCG CTCAAGGCCG CGGACCCCAA TGGTCGCGGC ACGGTCGCCG ACATCGCCGC ACGCTGGGGC TTCTGGCACC TGTCGCACTT CTCCGCCGAG TACAAGACGA TGTTCGGCGA GCTGCCGTCG GACACGCTCA GGCGCGCGGG CTGA
|
Protein sequence | MSPAAQPSGN APYSYTLRHT LDADEHAACL TNWHQRYDQL TAGAFDGVFE EFCFGRVQLF REGLNQSVHQ AGGAWPGSRT FAVPVAIEGT GWFGGEMYDA HSMLTLGGDD ELDFRTPRRL EILACTADSA ALHDYAHQVD HRNLEAELAG RKLAPTTPAG IAALGQLLAT MTASLRATPE LLLHPQMRKA MEQALFATLL DTLSCGGGRA GAPSCRARQQ VVARARAYME AHIDEPITVA DLCIELGISR RTLQYSFQDV LDLNPVKFLR AIRLNAVRRS LKAADPNGRG TVADIAARWG FWHLSHFSAE YKTMFGELPS DTLRRAG
|
| |