Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3372 |
Symbol | |
ID | 7873863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3685810 |
End bp | 3686937 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700309 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002890343 |
Protein GI | 237654029 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCCCCCGCC GTAGCCGCGC CACGGTCGCG CCCGGCTTCG TCACCGGCAT GCTGGCCGGG CTGCAGCGGC GCGGACTCGA TGGGGCGCCG CTGCTCGCAC GCGCCGGGAT CGACCTTGCA GAAACCGACA CGCGCATCCC AGTCGAGCGC TACGCCCTGC TCTACAACCT GTGCGTGGAG GCACTTGAAG ACGAGGCCTT CGGCCTGCTC CCGGAGCCGA TGCGCCCGGG CAGCTTCGAG TTCCTCTGCC GCGCACTGCT CGGCGCGCCC ACGCTGGGCG AGGCCCTGCT GCGCGCGATC CGCTTCCTGC GCATCGTGTT GCCCGCCTTC CGCATTGAGC TGCGGGTGGA CGGCGCGCGC GCCGAGTTGC TGCTGGACGA CGGCGGCAGC CTCGGCCCCG GGCTCGACGC CCCGGCGCGC GTGTTCGCCT ACGAATGGCT GCTGCGCCTG CTGCACGGCG TGGCGAGCTG GTTCGTCGGC CGCGGCCTGG CGCTCGACGC GGTCGCCTTC CCTTACGCAC GCCCGGCCCA TGCGGACGAC TACGCGCTCG TCTACACCGA GCATTCGAGC TTCGACGCGC CGCAGCTGGC CGCCCGCCTG CAGGCCAACC TGCTGGCGCT GCCGCTGCGC CGCGACGAGG CGGCGCTGGT CGGCTTCCTC GAGGGCGCAC CGGGAAAGAT CACCACCCTC TACCGGCGCG ACCGCGAGAT GGTCTTCCGC GTGCGCGACA TCCTGCGCGA CGCGCTGCCG CAGAACCTTT CGCTCGAAGA GGTCGCCGAG CGCCTGCACG TGTCGCCGCG CACCCTGCAC CGGCGGCTGG AGGATGAGGG CTCGGGATTC CGCAACATCA AGGAGGCCAC CCGCCGCGAC ATCGCCTATG CGCGCCTGGC CAAGACCCGC CAGCCCATCG CCCGCATCGC GGCCGAGCTC GGCTACGCCG ACCCGTCCAC CTTCTACCGC GCCTTCGTCG CCTGGAGCGG CATGTCGCCG GAGCAGTTCC GGCACCGGCT GGCGGGCAAC GACGGTCTTC CCGCGGCGTC TGCCGGTCCG GACAGGCGCC CTGCCCCAAC CCCCCGTGTC ACCGCCGGAC GGCAGCGATC ACGCGAATTC GGTCCGACCG AGGCATAG
|
Protein sequence | MPRRSRATVA PGFVTGMLAG LQRRGLDGAP LLARAGIDLA ETDTRIPVER YALLYNLCVE ALEDEAFGLL PEPMRPGSFE FLCRALLGAP TLGEALLRAI RFLRIVLPAF RIELRVDGAR AELLLDDGGS LGPGLDAPAR VFAYEWLLRL LHGVASWFVG RGLALDAVAF PYARPAHADD YALVYTEHSS FDAPQLAARL QANLLALPLR RDEAALVGFL EGAPGKITTL YRRDREMVFR VRDILRDALP QNLSLEEVAE RLHVSPRTLH RRLEDEGSGF RNIKEATRRD IAYARLAKTR QPIARIAAEL GYADPSTFYR AFVAWSGMSP EQFRHRLAGN DGLPAASAGP DRRPAPTPRV TAGRQRSREF GPTEA
|
| |