Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0731 |
Symbol | |
ID | 7083960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 816024 |
End bp | 817166 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697757 |
Product | Carbonate dehydratase |
Protein accession | YP_002354399 |
Protein GI | 217969165 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0288] Carbonic anhydrase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCC GTCTTTGCGC CCTCGGCGCC AACGACACCG TCGAGCTCGA GCACGTGCGC GGCTTCTTCC GCCAGTACGC CGCCTGGCTC GGCGTGGACC TGAGCTTCCA GGGGTTCGCC GACGAGATCG CCAACCTGCC GGGGGCCTAC GGCGCCGCCG ACGGCCGCCT GTTCTACGCC GAGGTCGATG GCAAGCCGGC CGGCTGCGTG GGCATCCGCC GCTTCTCCGA GGGCGTGTGC GAGATGAAGC GGCTCTACGT CGATCCGGCC TTCCGTGGTG GCGGCGTGGG GCGCAAGCTC GCGCTCGCCG CCATCAAGGC GGCGCGTCTG TTCGGCTACC GCCGCATCCT GCTCGACACC ATCCCGTCGA TGCGGATCGC GGTCAAGCTC TACCGCGAGC TCGGCTTCAA GGAAGTGCCC GCCTGCTACC CCTCGCCGAT CGAGGGTGCG ATCTTCCTGA CCCTGGACCT GGAGAACTGG TCGGAGGACG ACGTCAGCAA CGAGAACCTC TTCCACCTCT TCGACTACAA CCAGGCGTGG TCGCGCCAGA TGCAGCAGCT CGACCCCGGC TTCTTCGGCA AGCTGGCGCA GCTTCAGGCC CCGGAGTACC TCTGGATCGG CTGTTCCGAC TCGCGCGTGC CGGCCAACCA GATCGTCGGC CTGCTGCCGG GCGAGGTCTT CGTCCACCGC AACATCGCCA ACGTGATCGT GCACACCGAC CTCAACGCGC TGGCGGTGAT CCAGTACGCG GTCGATGTAC TGCAGGTGAA GCACATCATG GTCGTTGGCC ACTACGGCTG CGGCGGCGTC AAGGCCGCGC TCGAACGTGC GCGCGTCGGC CTCGTCGATC TTTGGCTGCG CCACGTGCAG GACGTCCATG TGCGCCATCT GAAGGCCGTC GATGGCCTGG CGCCCGAACT GCGCCACGAT CGCCTGTGCG AGCTCAACGT CATCGAGCAG GTGGCCAACG TGGCGCAGAC CGTGGTCGTG CAGGACGCCT GGCGGCGCGG CCAGCGCCTC ACCGTGCATG GCTGGATCTA CGGCCTGCAG GACGGTCTGA TCCGCGACCT CGGGATGAAC CTCAGCCGTT CCGACGATCT CATGCCGCGC TACGTCGCGG CGCTGGAGGC GCTCGGCAAC TGA
|
Protein sequence | MPTRLCALGA NDTVELEHVR GFFRQYAAWL GVDLSFQGFA DEIANLPGAY GAADGRLFYA EVDGKPAGCV GIRRFSEGVC EMKRLYVDPA FRGGGVGRKL ALAAIKAARL FGYRRILLDT IPSMRIAVKL YRELGFKEVP ACYPSPIEGA IFLTLDLENW SEDDVSNENL FHLFDYNQAW SRQMQQLDPG FFGKLAQLQA PEYLWIGCSD SRVPANQIVG LLPGEVFVHR NIANVIVHTD LNALAVIQYA VDVLQVKHIM VVGHYGCGGV KAALERARVG LVDLWLRHVQ DVHVRHLKAV DGLAPELRHD RLCELNVIEQ VANVAQTVVV QDAWRRGQRL TVHGWIYGLQ DGLIRDLGMN LSRSDDLMPR YVAALEALGN
|
| |