Gene Tmz1t_3462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3462 
Symbol 
ID7872968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3786951 
End bp3788795 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content75% 
IMG OID643700402 
Productcarbonic anhydrase 
Protein accessionYP_002890433 
Protein GI237654119 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3338] Carbonic anhydrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACGC CGATGAAAAA ACCGCGCCCT CTCCTCCTCG CCCTGCTGCT CGCCCACCTC 
GCCGGCGTGG CCTGTGCCGC CGACTGGCAG CTCGTGCTGA GCGACCGCGA CCGCCGCATC
GAGATCGATC GCGGCAGCAT CCTGCGCTCG GACGCCGGCA CCAAGATCTC CTGGGGCCGG
GTGGTGCTGA CCTCGGAGGA GGCGCAGCGC TCCGGCTACG CCATGATCAA GGCGCTCAAC
CGCTACGACT GCCGCAACCG CAGCTTCGCC ACGGTCAAGC GGGTGTATGT CGATGCCGAC
GACAACATCG TGCGCGAGGA GCCGGTGGCG GCCGAGGCGC CGATCGCGGT GGCGCGCAAC
ACGGTGGACG AGCGCATGTG GCGCGAGGTG TGCGATCCGC CCTCGGCGGT CGAGCTGCAG
AAGATGGCGG CCGAGGCGGC CCGGATCGCG AGCGCGCAGC CGGCGGCGGT GGCGGCCCCT
GGGGGCGCCG CAGCGCCCGC GGCTGCGGCG GTTTCGACGC CCCCTGCCGC GGCCGCCGCC
ACGGCCGTGC AGTCGCCTGC GGCGAAGCCC GCGGAGTCTG CGCGCGCGCA GTCCGGCAGC
GTGCGCCAGC CGAGCGCGGA GTCCGTCTCC GCCGAGGTGG TGCGTACCGG TGCCGGCGGT
GAGCAGCCCG CCGCGGCGCC CCCCGCCTTC GAGACCCACG TCGAGATCCC GGCCAAGTTC
CTGCCTGGCG GTCAGGGCGC CGCGGCGCCC GAGTCCTCTC GCAAGTCCAT TCTGCCGCCG
CTGCCCAAGT TGTCGGCGAC GGCCGAGCCC GAGGCGAAGA CCGCGCCGGC CCCCAAGCCG
CCGGTGCGCG AACAGCCGCC TGCGCAGGGG CGCGCTGCGC CCGCTCCGCA GGCGGCTGTG
CCGGCCCGCC AGGCGGCGCC AGCGCCGGAG CGGCCGGTCG CGCCGGAGCG CGCGGGCGCA
CGCGAGAAGG CGTCCCCGGC CAGGGCCGCG TCCGCCGTGC CGGCAAGACC GACGCCGCCG
CGCCCGCGTC CCGCCGGTCC GGAACGCGTC GCCGCAGCGC AGCCGGCATC GCCGGACGCG
ACTCCCGCGC CGCCCGAGCG CGCCGCGTCG GCCCGCGAGG AGGCCTTGCG CGCGGCCGGC
ATCGAGCTCA CCCGCCTCGG CCCGGGGGTG CCGGAGTGGA GCTACGAGGG CGAGCGCGGT
CCGCAGCACT GGGGCCGCAT GCGCCCCGAG TGGCGGCTCT GCGAGGAGGG CACGCGGCAG
TCTCCGATCG ACCTGCGCGA CGGCATCGCG GTCGATCTCG CGCCGGTGCG CTTCGACTAC
CGTCGCACCG GCTTCCGCAT CCGCGATACC GGCAACACCT TGCAGGTCGA GGTTGGCGAG
GGCATGGGCA TCACCGTGCG TGGGGTGCGT TACGCGCTCG AGCGCCTCAC CCTGCACCGC
CCCTCGCAGG ATCGCGTCGG CGGCATGGCG CACGACATGG CGATCTACCT GCAGCATCGC
GCCGATGACG GCCGCATGGC GATCGTCTCG CTGCTGCTGT CGGCGGGCGG CGACGCCAGC
CCGGCGCTGC AGACCTTGTG GAACAACCTG CCGCTCGATC GCGGGCGCGA GTTCGTTCCC
GACGCCGTGC TCGACCTGCC GGCGCTGGTG CCGGCCGACC CGGGCCACTA CCTCTACACC
GGCTCGTTGC CGATGCCGCC GTGTACCGAG GACGTGCTGT GGGTGGTGAT GAAGCAGCCG
GTGACGATCT CCGCGGACCA GCTCGACGTC TTCGCCCGCC TGTACCCGCG CAACGGCCGG
CCGATCCAGC CGACCAACGG CCGGCCGCTG CTCGAGTCGC GCTGA
 
Protein sequence
MPTPMKKPRP LLLALLLAHL AGVACAADWQ LVLSDRDRRI EIDRGSILRS DAGTKISWGR 
VVLTSEEAQR SGYAMIKALN RYDCRNRSFA TVKRVYVDAD DNIVREEPVA AEAPIAVARN
TVDERMWREV CDPPSAVELQ KMAAEAARIA SAQPAAVAAP GGAAAPAAAA VSTPPAAAAA
TAVQSPAAKP AESARAQSGS VRQPSAESVS AEVVRTGAGG EQPAAAPPAF ETHVEIPAKF
LPGGQGAAAP ESSRKSILPP LPKLSATAEP EAKTAPAPKP PVREQPPAQG RAAPAPQAAV
PARQAAPAPE RPVAPERAGA REKASPARAA SAVPARPTPP RPRPAGPERV AAAQPASPDA
TPAPPERAAS AREEALRAAG IELTRLGPGV PEWSYEGERG PQHWGRMRPE WRLCEEGTRQ
SPIDLRDGIA VDLAPVRFDY RRTGFRIRDT GNTLQVEVGE GMGITVRGVR YALERLTLHR
PSQDRVGGMA HDMAIYLQHR ADDGRMAIVS LLLSAGGDAS PALQTLWNNL PLDRGREFVP
DAVLDLPALV PADPGHYLYT GSLPMPPCTE DVLWVVMKQP VTISADQLDV FARLYPRNGR
PIQPTNGRPL LESR