Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0400 |
Symbol | |
ID | 7084911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 459773 |
End bp | 460849 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643697433 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002354076 |
Protein GI | 217968842 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAC CGGCCAATCG ATTGACATTT TCTGCCACCA ATCTCTTGCA CATGAACGAG CGTCAGCACG CACCGGCGGT CACCGTCTCC ATCCGCATGG TCTGCCAGAT CCTGGATCGG CTGGGTGAGC ACAGGCCGCT GGCCCTGGAA TGCGTCGAGC GCGCCGGGAT CGCTTCCGCG CTGCTGGAAC ACGAATCGGC CCGTGTGACG GTCGAGCAGT TCGCGCTCTT CTACCGCACC CTCGCGGTCG AACTCGACGA CGAGACGCCA GGGCTGTTCT CACGCCCGCT GCGCCCTGGC ACGCTGAAGT TCCTGTGCCT TGGAATGCTC GACGCCGCCA ACCTGCGCGT CGCGCTGCAC CGCTTCTGCT GGTTTTTCCG CCTGGTGCTG GACGATCTGC ACTTCGAGCT GGGTGAGGAC GAGGGCCTGA GCCGCATCGC GCTCGTCGAG CGCGTTGCGC TCCAACCCCA CCGCCCCCTG ATCCTCGAAT TGATGCTGAT GCTGGTGCAG GGCATCGCTT CGTGGATGAT CGAGCGCAAG CTCCTGTTCG CGCGCGTGGA TCTCGCCTAT CCGGCGCCGC CGCATGCGGG TGAGTACATC AACATGTTCG CCGGCCCGGC CTGCTTCGAT CGCCCGCTCA CCGCGCTCTA CATCGAGCCG GCCTTCATGG ACGCCCCGAT CCGCCAGGAC AAGGCCGCGC TGTCGGCCTT CCTGCGCAAG GCGCCGATGG ACTGGATCCA CGTCTCCGTG AGCGAGCGGC TGTTCACGCA CCGCGTGCGC GATCTGCTCG AAGCCGGGCT GGGAAGCCCG CAGTCCGTGG AAGACGTTGC CCGTATGCTG CACATCTCGG CGCGCACGCT CGCCCGCCGG CTCGATGCGG AAGGAACGCA CTTCCAGGCG GTCAAGGACG CGCTGCGCCG CGACGTCGCC ATCGCGCGGA TCTCGCGCAC CGACGAGCCG ATCGGCAGCA TCGGCGCCGA TCTCGGCTTC GACGATCCCG CGGCTTTCAA TCGCGCCTTC AAGCAATGGA CGGGATCCCC GCCAGGCAGC TACCGCAGGG CGGGGGCGGG CCGCTGA
|
Protein sequence | MTEPANRLTF SATNLLHMNE RQHAPAVTVS IRMVCQILDR LGEHRPLALE CVERAGIASA LLEHESARVT VEQFALFYRT LAVELDDETP GLFSRPLRPG TLKFLCLGML DAANLRVALH RFCWFFRLVL DDLHFELGED EGLSRIALVE RVALQPHRPL ILELMLMLVQ GIASWMIERK LLFARVDLAY PAPPHAGEYI NMFAGPACFD RPLTALYIEP AFMDAPIRQD KAALSAFLRK APMDWIHVSV SERLFTHRVR DLLEAGLGSP QSVEDVARML HISARTLARR LDAEGTHFQA VKDALRRDVA IARISRTDEP IGSIGADLGF DDPAAFNRAF KQWTGSPPGS YRRAGAGR
|
| |