Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3462 |
Symbol | |
ID | 7872968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3786951 |
End bp | 3788795 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643700402 |
Product | carbonic anhydrase |
Protein accession | YP_002890433 |
Protein GI | 237654119 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3338] Carbonic anhydrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGACGC CGATGAAAAA ACCGCGCCCT CTCCTCCTCG CCCTGCTGCT CGCCCACCTC GCCGGCGTGG CCTGTGCCGC CGACTGGCAG CTCGTGCTGA GCGACCGCGA CCGCCGCATC GAGATCGATC GCGGCAGCAT CCTGCGCTCG GACGCCGGCA CCAAGATCTC CTGGGGCCGG GTGGTGCTGA CCTCGGAGGA GGCGCAGCGC TCCGGCTACG CCATGATCAA GGCGCTCAAC CGCTACGACT GCCGCAACCG CAGCTTCGCC ACGGTCAAGC GGGTGTATGT CGATGCCGAC GACAACATCG TGCGCGAGGA GCCGGTGGCG GCCGAGGCGC CGATCGCGGT GGCGCGCAAC ACGGTGGACG AGCGCATGTG GCGCGAGGTG TGCGATCCGC CCTCGGCGGT CGAGCTGCAG AAGATGGCGG CCGAGGCGGC CCGGATCGCG AGCGCGCAGC CGGCGGCGGT GGCGGCCCCT GGGGGCGCCG CAGCGCCCGC GGCTGCGGCG GTTTCGACGC CCCCTGCCGC GGCCGCCGCC ACGGCCGTGC AGTCGCCTGC GGCGAAGCCC GCGGAGTCTG CGCGCGCGCA GTCCGGCAGC GTGCGCCAGC CGAGCGCGGA GTCCGTCTCC GCCGAGGTGG TGCGTACCGG TGCCGGCGGT GAGCAGCCCG CCGCGGCGCC CCCCGCCTTC GAGACCCACG TCGAGATCCC GGCCAAGTTC CTGCCTGGCG GTCAGGGCGC CGCGGCGCCC GAGTCCTCTC GCAAGTCCAT TCTGCCGCCG CTGCCCAAGT TGTCGGCGAC GGCCGAGCCC GAGGCGAAGA CCGCGCCGGC CCCCAAGCCG CCGGTGCGCG AACAGCCGCC TGCGCAGGGG CGCGCTGCGC CCGCTCCGCA GGCGGCTGTG CCGGCCCGCC AGGCGGCGCC AGCGCCGGAG CGGCCGGTCG CGCCGGAGCG CGCGGGCGCA CGCGAGAAGG CGTCCCCGGC CAGGGCCGCG TCCGCCGTGC CGGCAAGACC GACGCCGCCG CGCCCGCGTC CCGCCGGTCC GGAACGCGTC GCCGCAGCGC AGCCGGCATC GCCGGACGCG ACTCCCGCGC CGCCCGAGCG CGCCGCGTCG GCCCGCGAGG AGGCCTTGCG CGCGGCCGGC ATCGAGCTCA CCCGCCTCGG CCCGGGGGTG CCGGAGTGGA GCTACGAGGG CGAGCGCGGT CCGCAGCACT GGGGCCGCAT GCGCCCCGAG TGGCGGCTCT GCGAGGAGGG CACGCGGCAG TCTCCGATCG ACCTGCGCGA CGGCATCGCG GTCGATCTCG CGCCGGTGCG CTTCGACTAC CGTCGCACCG GCTTCCGCAT CCGCGATACC GGCAACACCT TGCAGGTCGA GGTTGGCGAG GGCATGGGCA TCACCGTGCG TGGGGTGCGT TACGCGCTCG AGCGCCTCAC CCTGCACCGC CCCTCGCAGG ATCGCGTCGG CGGCATGGCG CACGACATGG CGATCTACCT GCAGCATCGC GCCGATGACG GCCGCATGGC GATCGTCTCG CTGCTGCTGT CGGCGGGCGG CGACGCCAGC CCGGCGCTGC AGACCTTGTG GAACAACCTG CCGCTCGATC GCGGGCGCGA GTTCGTTCCC GACGCCGTGC TCGACCTGCC GGCGCTGGTG CCGGCCGACC CGGGCCACTA CCTCTACACC GGCTCGTTGC CGATGCCGCC GTGTACCGAG GACGTGCTGT GGGTGGTGAT GAAGCAGCCG GTGACGATCT CCGCGGACCA GCTCGACGTC TTCGCCCGCC TGTACCCGCG CAACGGCCGG CCGATCCAGC CGACCAACGG CCGGCCGCTG CTCGAGTCGC GCTGA
|
Protein sequence | MPTPMKKPRP LLLALLLAHL AGVACAADWQ LVLSDRDRRI EIDRGSILRS DAGTKISWGR VVLTSEEAQR SGYAMIKALN RYDCRNRSFA TVKRVYVDAD DNIVREEPVA AEAPIAVARN TVDERMWREV CDPPSAVELQ KMAAEAARIA SAQPAAVAAP GGAAAPAAAA VSTPPAAAAA TAVQSPAAKP AESARAQSGS VRQPSAESVS AEVVRTGAGG EQPAAAPPAF ETHVEIPAKF LPGGQGAAAP ESSRKSILPP LPKLSATAEP EAKTAPAPKP PVREQPPAQG RAAPAPQAAV PARQAAPAPE RPVAPERAGA REKASPARAA SAVPARPTPP RPRPAGPERV AAAQPASPDA TPAPPERAAS AREEALRAAG IELTRLGPGV PEWSYEGERG PQHWGRMRPE WRLCEEGTRQ SPIDLRDGIA VDLAPVRFDY RRTGFRIRDT GNTLQVEVGE GMGITVRGVR YALERLTLHR PSQDRVGGMA HDMAIYLQHR ADDGRMAIVS LLLSAGGDAS PALQTLWNNL PLDRGREFVP DAVLDLPALV PADPGHYLYT GSLPMPPCTE DVLWVVMKQP VTISADQLDV FARLYPRNGR PIQPTNGRPL LESR
|
| |