Gene Tmz1t_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0731 
Symbol 
ID7083960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp816024 
End bp817166 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content68% 
IMG OID643697757 
ProductCarbonate dehydratase 
Protein accessionYP_002354399 
Protein GI217969165 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0288] Carbonic anhydrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACCC GTCTTTGCGC CCTCGGCGCC AACGACACCG TCGAGCTCGA GCACGTGCGC 
GGCTTCTTCC GCCAGTACGC CGCCTGGCTC GGCGTGGACC TGAGCTTCCA GGGGTTCGCC
GACGAGATCG CCAACCTGCC GGGGGCCTAC GGCGCCGCCG ACGGCCGCCT GTTCTACGCC
GAGGTCGATG GCAAGCCGGC CGGCTGCGTG GGCATCCGCC GCTTCTCCGA GGGCGTGTGC
GAGATGAAGC GGCTCTACGT CGATCCGGCC TTCCGTGGTG GCGGCGTGGG GCGCAAGCTC
GCGCTCGCCG CCATCAAGGC GGCGCGTCTG TTCGGCTACC GCCGCATCCT GCTCGACACC
ATCCCGTCGA TGCGGATCGC GGTCAAGCTC TACCGCGAGC TCGGCTTCAA GGAAGTGCCC
GCCTGCTACC CCTCGCCGAT CGAGGGTGCG ATCTTCCTGA CCCTGGACCT GGAGAACTGG
TCGGAGGACG ACGTCAGCAA CGAGAACCTC TTCCACCTCT TCGACTACAA CCAGGCGTGG
TCGCGCCAGA TGCAGCAGCT CGACCCCGGC TTCTTCGGCA AGCTGGCGCA GCTTCAGGCC
CCGGAGTACC TCTGGATCGG CTGTTCCGAC TCGCGCGTGC CGGCCAACCA GATCGTCGGC
CTGCTGCCGG GCGAGGTCTT CGTCCACCGC AACATCGCCA ACGTGATCGT GCACACCGAC
CTCAACGCGC TGGCGGTGAT CCAGTACGCG GTCGATGTAC TGCAGGTGAA GCACATCATG
GTCGTTGGCC ACTACGGCTG CGGCGGCGTC AAGGCCGCGC TCGAACGTGC GCGCGTCGGC
CTCGTCGATC TTTGGCTGCG CCACGTGCAG GACGTCCATG TGCGCCATCT GAAGGCCGTC
GATGGCCTGG CGCCCGAACT GCGCCACGAT CGCCTGTGCG AGCTCAACGT CATCGAGCAG
GTGGCCAACG TGGCGCAGAC CGTGGTCGTG CAGGACGCCT GGCGGCGCGG CCAGCGCCTC
ACCGTGCATG GCTGGATCTA CGGCCTGCAG GACGGTCTGA TCCGCGACCT CGGGATGAAC
CTCAGCCGTT CCGACGATCT CATGCCGCGC TACGTCGCGG CGCTGGAGGC GCTCGGCAAC
TGA
 
Protein sequence
MPTRLCALGA NDTVELEHVR GFFRQYAAWL GVDLSFQGFA DEIANLPGAY GAADGRLFYA 
EVDGKPAGCV GIRRFSEGVC EMKRLYVDPA FRGGGVGRKL ALAAIKAARL FGYRRILLDT
IPSMRIAVKL YRELGFKEVP ACYPSPIEGA IFLTLDLENW SEDDVSNENL FHLFDYNQAW
SRQMQQLDPG FFGKLAQLQA PEYLWIGCSD SRVPANQIVG LLPGEVFVHR NIANVIVHTD
LNALAVIQYA VDVLQVKHIM VVGHYGCGGV KAALERARVG LVDLWLRHVQ DVHVRHLKAV
DGLAPELRHD RLCELNVIEQ VANVAQTVVV QDAWRRGQRL TVHGWIYGLQ DGLIRDLGMN
LSRSDDLMPR YVAALEALGN