Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4022 |
Symbol | |
ID | 7873668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4420642 |
End bp | 4421589 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700959 |
Product | transcriptional regulator, XRE family |
Protein accession | YP_002890982 |
Protein GI | 237654668 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCTC AAGCCAACTG CGCGCTGCCA CCGGGGTTTC AGCTCGACCA GTACCGCATC GAGCGGCAAC TGTCGCTGGG CGGCTTCTCG ATCGTCTATC TCGCGCGCGA CGCGCATGGC ATGGCCGTGG CGATCAAGGA ATACCTGCCC AACTCGCTCG CCCTGCGCAA GGAAGGCGAG ACCGAACCGC AGGTCAGCGA GGAGCATCGC CTCGCCTTCC GCTACGGCAT GAAGTGCTTC TTCGAGGAAG GCCGCTCGCT CGCCAAGCTG ATGCATCCGA ACGTGGTGCG GGTGCTCAAC TTCTTCCGCG CCAACGGCAC GGTGTACATG GTCATGCAGT TCGAGCGCGG CCGCACCCTG CACGACTACA TCCGCAAGCA CCGCGGCGAG GTCAAGGAGA TGCTGATCCG CGCGGTGTTC GCGCGCATGC TCAATGGCCT GCGCGAGGTG CACGCGCACA AGCTCATGCA CCTCGACATC AAGCCGTCCA ACATCTACCT GCGCAACGAC GGCACCCCGG TGCTGCTCGA CTTCGGCGCC GCGCGCCAGA CCCTGATGAG CGACCAGCCG GTGCTCAAGC CGATGTACAC GCCCGGATAC GCCGCGCCGG AACAATACGA GAAGGGCGCC CAGCTCGGTC CGTGGACCGA CATCTACAGC GTGGGTGCGA GCCTGCACGC CTGCTTCGTC GGCAGCCCGC CGCCGCGCGC GGACGAACGC GCGCGGGAAG ACACGCTCCA GCCCCTGGCG AGGACCCAGG CCGGGCGCTA CAGCCGGCAG CTGCTCGAGC TCGTCGACTG GTGCCTTCAC CTCGACCCGC AGCAGCGCCC GCCCAGCGTG TATGCGCTGC AGAAGGCGCT GATCCATCAA GAAAACGCCC CCGAGCGTCC GGAACACTGG TTCTCGGACT TGGGCAGCCG GCTGCGCTCC TTCATCGGCC GCAGCTGA
|
Protein sequence | MPSQANCALP PGFQLDQYRI ERQLSLGGFS IVYLARDAHG MAVAIKEYLP NSLALRKEGE TEPQVSEEHR LAFRYGMKCF FEEGRSLAKL MHPNVVRVLN FFRANGTVYM VMQFERGRTL HDYIRKHRGE VKEMLIRAVF ARMLNGLREV HAHKLMHLDI KPSNIYLRND GTPVLLDFGA ARQTLMSDQP VLKPMYTPGY AAPEQYEKGA QLGPWTDIYS VGASLHACFV GSPPPRADER AREDTLQPLA RTQAGRYSRQ LLELVDWCLH LDPQQRPPSV YALQKALIHQ ENAPERPEHW FSDLGSRLRS FIGRS
|
| |