Gene Tmz1t_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3071 
Symbol 
ID7874541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3324764 
End bp3326170 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content70% 
IMG OID643699994 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_002890046 
Protein GI237653732 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTC AAACGCTGTA TGAAAAGCTC TGGTCCAGCC ACGTGGTCCA CGAGGAGGCC 
GACGGCACGG CGCTGATCTA CATCGATCGC CACCTGGTGC ACGAAGTGAC CAGCCCGCAG
GCCTTCGAAG GCCTCAAGCT CGCCGGGCGC AAGCCCTGGC GCGTCGGCTC CATCGTCGCC
ACCGCCGACC ACAACACGCC CACCGACCAC TGGGACCTGG GCATCCAGGA CCCGGTGTCG
CGCCAGCAGG TCGAGACGCT GGACGCCAAC ATCCGCGAGG TCGGCTCGCT CGCCTATTTC
CCGTTCAAGG ACGCGCGCCA GGGCATCGTG CACGTGATCG GGCCGGAGAA CGGCGCCACG
CTGCCGGGCA TGACCGTCGT CTGCGGCGAC TCCCACACCT CCACGCACGG CGCCTTCGGC
TGCCTTGCGC ACGGCATCGG CACCTCCGAG GTCGAGCACG TGCTCGCCAC CCAATGCCTG
CTGCAGAAGA AGTCCAGGAC CCTGCTGATC CACGTCGACG GCCAGCTCGG CCGCGGCGTC
ACCGCCAAGG ACGTGGTGCT CGCCATCATC GGCAGGATCG GCACCGCCGG CGGCACCGGC
TACGCGATGG AGTTCGGCGG CAGCGCCATC CGCGCGCTGT CGATGGAAGG CCGCATGACG
ATCTGCAACA TGGCGATCGA GGCGGGCGCG CGCGCCGGGC TGGTCGGCGT GGACGAGACC
ACCATCGAAT ACCTGAAGGA TCGCCCCTTC TCGCCCAAGG GCGCGCAGTG GGAGCAGGCG
GTCGACTACT GGCGCAGCCT GCACAGCGAC GAGGGCGCCG AGTTCGACAA GATCATCGAA
CTCAAGGCCG AAGACATCCT GCCGCAGGTC ACCTGGGGCA CCTCGCCCGA GATGGTCACC
ACGGTCGACG GCCGTGTGCC CGATCCCGCC GCCGTCACCG ACCCGGTGCG CCGCGAGGGC
ATCGAGCGCG CGCTCAAGTA TATGGGCCTC GAGGCCAACA CCCCGATCAC CGACATCCCG
GTCGACCAGG TCTTCATCGG CTCGTGCACC AACTCGCGCA TCGAGGATTT GCGTGAGGCT
GCCGCGGTGG CCAAGGGCCG CAGCAAGGCC GCCAGCGTCA AGCGCGTGCT GGTGGTGCCG
GGCTCCGGCC TGGTCAAGCG CCAGGCCGAG GCGGAGGGCC TGCACGAGAT CTTCCTCGCC
GCCGGCTTCG AGTGGCGCGA GCCGGGCTGT TCGATGTGCC TGGCGATGAA CGCCGACCGC
CTCGAGCCTG GCGAGCGCTG CGCCTCGACC TCGAACCGCA ACTTCGAGGG CCGCCAGGGT
GCGGGGGGGC GCACCCACCT GGTCAGCCCG GCGATGGCCG CGGCCGCCGC GGTCACCGGC
CGCTTCACCG ACGTGCGCGC GCTCTGA
 
Protein sequence
MKAQTLYEKL WSSHVVHEEA DGTALIYIDR HLVHEVTSPQ AFEGLKLAGR KPWRVGSIVA 
TADHNTPTDH WDLGIQDPVS RQQVETLDAN IREVGSLAYF PFKDARQGIV HVIGPENGAT
LPGMTVVCGD SHTSTHGAFG CLAHGIGTSE VEHVLATQCL LQKKSRTLLI HVDGQLGRGV
TAKDVVLAII GRIGTAGGTG YAMEFGGSAI RALSMEGRMT ICNMAIEAGA RAGLVGVDET
TIEYLKDRPF SPKGAQWEQA VDYWRSLHSD EGAEFDKIIE LKAEDILPQV TWGTSPEMVT
TVDGRVPDPA AVTDPVRREG IERALKYMGL EANTPITDIP VDQVFIGSCT NSRIEDLREA
AAVAKGRSKA ASVKRVLVVP GSGLVKRQAE AEGLHEIFLA AGFEWREPGC SMCLAMNADR
LEPGERCAST SNRNFEGRQG AGGRTHLVSP AMAAAAAVTG RFTDVRAL