Gene Tmz1t_3464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3464 
Symbol 
ID7872970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3790037 
End bp3791956 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content66% 
IMG OID643700404 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002890435 
Protein GI237654121 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA ACGAAAAATT CATCGCCGCC AAGGCCCACG TCGACGAGGC CGCGATCGCC 
CCGCTGCCCA ACTCGCGCAA GATCTACGTC GAGGGCTCGC GCCCCGACAT CCGCGTGCCG
ATGCGCGAGA TCTCGCAGGC CGACACCCCG GCCTCCTTCG GCGCCGAGCA GAACCCGCCG
ATCTTCGTCT ATGACTGTTC GGGCCCCTAC TCCGACCCGG CCGCGAAGAT CGACATCCGC
TCCGGCCTGC CCGCGCTGCG CCAGCAGTGG ATCGAGGAGC GCGCCGACAC CGAGCTGCTG
CCCGACCTGT CGTCCGAATT CGGCCGCCAG CGCGCCGCCG ACAAGAGCCT GGACGAGCTG
CGCTTCCCCG GCCTGCACCG CAAGCCGCGC CGCGCCAAGG CCGGCGCGAA CGTGTCGCAG
ATGCACTACG CGCGCCGCGG CATCATCACG CCCGAGATGG AATACGTCGC CATCCGCGAG
AACCTCAACC GCGAGCAGTA CATCGCCTCG CTGCGCGCCA GTGGCGGCCT CAAGGGCCAG
AAGATGGCCG ACATGATGCT GCGCCAGCAC CCGGGCCAGA ACTTCGGCGC CAGCCTGCCG
GCGACGATCA CGCCCGAGTT CGTGCGCGAC GAGATCGCCC GCGGCCGCGC CATCATCCCC
AACAACATCA ACCACCCCGA GAGCGAGCCG ATGATCATCG GCCGCAACTT CCTCACCAAG
ATCAACGCCA ACATCGGCAA CTCGGCGGTC ACCTCCAGCA TCGCCGAAGA GGTCGACAAG
ATGACCTGGT CGATCCGCTG GGGCGGCGAC ACGGTGATGG ACCTGTCCAC CGGCAAGAAC
ATCCACGAGA CCCGCGAGTG GATCATCCGC AACTCGCCGG TGCCGATCGG CACGGTGCCG
ATCTACCAGG CGCTGGAAAA GGTCAATGGC AAGGCCGAGG ACCTGACCTG GGAGATCTTC
CGCGACACCC TGATCGAGCA GGCCGAGCAG GGCGTGGACT ACTTCACCAT CCACGCCGGC
GTGCTGCTGC GCTACGTGCC GCTGACCGCG AACCGCATGA CCGGCATCGT CAGCCGCGGT
GGCTCGATCA TGGCCAAGTG GTGCCTGGCG CATCACAAGG AGAGCTTCCT GTACGAGCAC
TTCGAGGACA TCTGCGAGAT CATGAAGGCC TACGACGTGG CCTTCAGCCT CGGCGACGGC
CTGCGTCCCG GCTCGATCTA CGATGCCAAT GACGAGGCGC AACTCGGCGA GCTCAAGACC
CTGGGCGAGC TCACCGAGAT CGCCTGGCGC CACGACGTGC AGGTCATGAT CGAGGGCCCG
GGCCACGTGC CGCTGCAGCT CATCAAGGAG AACATGGACC TGCAGCTCGA GTGGTGCAAG
GAAGCGCCCT TCTACACCCT GGGGCCGCTG ACCACCGACA TCGCGCCGGG CTACGACCAC
ATCACCAGCG GCATCGGCGC GGCCACCATC GGCTGGTACG GCACCGCGAT GCTGTGCTAC
GTGACGCCCA AGGAGCATCT GGGCCTGCCC AACAAGCAGG ACGTCAAGGA AGGCATCATC
ACCTACAAGC TCGCCGCGCA TGCCGCCGAC CTCGCCAAGG GCCACCCGGG CGCGCAGATC
CGCGACAACG CGCTCTCCAA GGCGCGCTTC GAGTTCCGCT GGGAAGACCA GTTCAACCTC
GGCCTCGATC CGGACAAGGC GAAGGAATTC CACGACGAGA CCCTGCCCAA GGACTCGGCC
AAGGTGGCGC ACTTCTGCTC GATGTGCGGC CCGCACTTCT GCTCGATGAA GATCACCCAG
GACGTGCGCG ACTTCGCCGC CCAGCAGGGC ATCGACGAGG CCGAGGCGCT GAAGAAGGGC
ATGGAGGTGA AGTCGATCGA GTTCGTGAAG AGCGGGGCCG AGGTCTACCG CAACGTCTGA
 
Protein sequence
MNANEKFIAA KAHVDEAAIA PLPNSRKIYV EGSRPDIRVP MREISQADTP ASFGAEQNPP 
IFVYDCSGPY SDPAAKIDIR SGLPALRQQW IEERADTELL PDLSSEFGRQ RAADKSLDEL
RFPGLHRKPR RAKAGANVSQ MHYARRGIIT PEMEYVAIRE NLNREQYIAS LRASGGLKGQ
KMADMMLRQH PGQNFGASLP ATITPEFVRD EIARGRAIIP NNINHPESEP MIIGRNFLTK
INANIGNSAV TSSIAEEVDK MTWSIRWGGD TVMDLSTGKN IHETREWIIR NSPVPIGTVP
IYQALEKVNG KAEDLTWEIF RDTLIEQAEQ GVDYFTIHAG VLLRYVPLTA NRMTGIVSRG
GSIMAKWCLA HHKESFLYEH FEDICEIMKA YDVAFSLGDG LRPGSIYDAN DEAQLGELKT
LGELTEIAWR HDVQVMIEGP GHVPLQLIKE NMDLQLEWCK EAPFYTLGPL TTDIAPGYDH
ITSGIGAATI GWYGTAMLCY VTPKEHLGLP NKQDVKEGII TYKLAAHAAD LAKGHPGAQI
RDNALSKARF EFRWEDQFNL GLDPDKAKEF HDETLPKDSA KVAHFCSMCG PHFCSMKITQ
DVRDFAAQQG IDEAEALKKG MEVKSIEFVK SGAEVYRNV