Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1216 |
Symbol | |
ID | 7083876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1344720 |
End bp | 1346249 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698232 |
Product | CHAD domain containing protein |
Protein accession | YP_002354871 |
Protein GI | 217969637 |
COG category | [S] Function unknown |
COG ID | [COG3025] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCACG AAATCGAACT CAAGCTCGCG CTGCCCAGGC GCGCGCTCCC GGCGCTGCGC CGCCACCCGC TGGTGGCCGC GGCGGAGAAA TGCGGCAACG CCCTCACCCT CGACAACACC TACTACGACA CCCCGAAGCT GCAGCTCAAG GCGCGCAAGG TGGCCGTGCG CACCCGCCGC CAGGGCCGCC AGATGCTGCA GACGGTGAAA TGCGCGGCGG TATCGAGCGG CGGGCTGTCG CAGCGGCCGG AGTGGGAGAC CGCGTGGACC GGCGGCTTCG ATTTCACGCC GGTCGACGAC CCCGCCACCG CGCGTCTGCT CGAACGCCAC CGCGCAGAAC TGGTGCCGGT GTTCACCACC CGCTTCCGCC GCGAGACGCG GCGACTCGTC CCGCAGGCGG GCGTGTCCAT CCTGCTGATG ATCGACACCG GCGCGGTGCA TGTGCGCACC CCGGAGGGCG TCGAGCGCGA GGCGGAGATC TGCGAACTCG AGCTGGAACT CGAGAGCGGG CGCGCGCAGG ACCTCCTCGA CCTCGCCTGC ACGCTCGCGC AGGACCTGCC GCTGATGCCG GCCGACCTCT CCAAGGCCGA ACGCGGCTAC CGCCTTTTCC TCGACACGCC CGCGGGCGCG CTGCGCTCCG AGATCTCCAC GCTCGAGCCC GGCCAGAACG TGGTCGAGGC CTTCCAGGGC CTGGCACTGT CCTGCGTGCG CCAGTGGCAG GGCAACGCCG CGACCGCGCT CGCGCAGGGC GACCCCGACG CGATCGACCC CGACAACATC CACCAGCTGC GCGTCGCCCA TCGCCGCCTG CGCGCGCTGC TCAAGATCTT CGCACCCGCC CTGCCCGAGA CCTTCGCCGG CACCTGGAAC GCCCGCCTGC GCGACAACGC CAACCGCTTC GGCGATGCGC GCGACCTGGA TGTGTTCCAC GCCGAGCTGC TCGAGCCGGT GACCCCCGAG GGCCTCGCCG ACGCAGCCTC GATGGCGGCC CTGCTCGAGA CCGTGCGCAC GGCGCGCGCA AGCGCCCGCC ACCACGCCGG CGTCAGCCTC GACCTCGCCA CCCAGGGCCG GCTGCTGCTG GAGTTCACCG CCGCACTCCA CCGCCTGCGT GCCGACAGCC TCGCCGAAGC CGCCGACCTG CGCAGCTTCG CCCGCCTGCG CCTCGACCGG CTGCGCAAGC GCGCCCGCCG CGGCGCCCGG GCGGCGGCCA GCCTGGAGCC CACCCGCCTG CACGCGCTGC GCATCGACTT CAAGATGCTG CGCTACGGCG TCGAGTTCTT CGCGCCGCTG TTCGGCACCA GGAGCATCAC CCGCTATCTC GACGGCGTGG TGCGCGCGCA GACCACGCTC GGCTTCCTGC AGGACGTCGA CACCGCGCAC CAGCGCCTGG CCGACTGGTC GCGAACGCAG CCCGCGCTCG CCGCGGCCGC GGCCTTCGTG CTCGGCTGGC ACGCCCCGCG CTACGCCCGC CTGCGCCGCC GCGTGCTGCG CGAGTGCGAG CCGCTGCTGT GGGGCGGCAA GCCGTGGTGA
|
Protein sequence | MSHEIELKLA LPRRALPALR RHPLVAAAEK CGNALTLDNT YYDTPKLQLK ARKVAVRTRR QGRQMLQTVK CAAVSSGGLS QRPEWETAWT GGFDFTPVDD PATARLLERH RAELVPVFTT RFRRETRRLV PQAGVSILLM IDTGAVHVRT PEGVEREAEI CELELELESG RAQDLLDLAC TLAQDLPLMP ADLSKAERGY RLFLDTPAGA LRSEISTLEP GQNVVEAFQG LALSCVRQWQ GNAATALAQG DPDAIDPDNI HQLRVAHRRL RALLKIFAPA LPETFAGTWN ARLRDNANRF GDARDLDVFH AELLEPVTPE GLADAASMAA LLETVRTARA SARHHAGVSL DLATQGRLLL EFTAALHRLR ADSLAEAADL RSFARLRLDR LRKRARRGAR AAASLEPTRL HALRIDFKML RYGVEFFAPL FGTRSITRYL DGVVRAQTTL GFLQDVDTAH QRLADWSRTQ PALAAAAAFV LGWHAPRYAR LRRRVLRECE PLLWGGKPW
|
| |