Gene Tmz1t_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3036 
Symbol 
ID7874506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3285986 
End bp3287947 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content72% 
IMG OID643699959 
Productbifunctional 3-phosphoshikimate 1-carboxyvinyltransferase/cytidine monophosphate kinase 
Protein accessionYP_002890011 
Protein GI237653697 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR00017] cytidylate kinase
[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00127659 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTTC TCGATCTGCC CCCGATGCTG GGCGCCGCCG GCAGCGTCCG CCTGCCCGGC 
TCGAAGAGCA TCTCCAACCG CGTGCTGCTG CTGGCCGCGC TCGCCGAGGG CGAGACCGAC
ATTCGCGACC TGCTGCTGTC GGACGACGTC GAACGCATGC TCGAGGCCCT GCGCGCACTC
GGCGTGGACT GGCGGCGCGA GGGCGACAGC CTGAACTACC GTGTATGCGG CGTCGGCGGC
CCCTTCCCGG TCAAGACCGG CGATCTCTTC CTCGGCAACG CCGGCACCGC CTTCCGGCCG
CTCACCGCGG CGCTCGCGCT GTCGGGCGGC GAGTACCGCC TGTCGGGCGT ACCGCGCATG
CACGAGCGCC CGATCGGCGA CCTCGTCGAC GCGCTGCGCC AGCTCGGCGC CGACATCACC
TGCACCGCCA ACGAGGGCTA CCCTCCGCTG CATCTCAAGC CCGCCACGAT CCGTCCCGGT
GGCGTGGTGC GCGTGCGTGG CGATGTCTCC AGCCAGTTCC TCACCGCGCT GCTGATGGCG
CTGCCGCTGA CCGGAGTGGA GACCACGATC GAGGTGGTGG GCGAGCTCAT CTCCAAGCCC
TACATCCGCA TCACCCTCGA GCTGATGGCG CGCTTCGGCG TGCAGGTCGG GCAGCAGGGC
TGGGAGCGCT TCGTGGTGCC GGGCGGCGCG CGCTACCGCA GCCCGGGCAC GGTGTTCGTC
GAGGGCGACG CCTCTTCGGC ATCCTATTTC CTCGCCGCCG GGGCGATCGG TGGTGGGCCG
GTGCGGGTGG AAGGCGTGGG GCGGACCAGC ATCCAGGGCG ACGTGCGCTT CGCCGAGGCG
CTGGAGCAAC TCGGCGCGCG CATCACCCTG GGCGACAACT GGATCGAGGC CGCGGCGCCC
GCCGGCGGCG TGCTGAAGGC CTTCGACCTC GACCTCAACC ACATCCCCGA CGCGGCGATG
ACGCTGGCGG TGGCGGCGCT GTTCGCCGAC GGGCCGTGCC GGCTGCGCAA CATCGCGAGC
TGGCGGGTCA AGGAGACCGA CCGCATCGCC GCGATGGCGA CCGAGCTGCG CAAGCTCGGC
GCCGAGGTGG AGGAGGGCGC CGACTACCTG GTGGTGCAGC GCCCGCCGCG CCTGCAGCCG
GCGGCGATCG ACACCTACGA CGACCACCGC ATGGCGATGT GCTTCTCGCT GGCGAGCCTG
GGGGGCTGCC GCGTGCGCAT CAACGACCCG AAGTGCGTGA ACAAGACCTT CCCCGGCTAT
TTCGAGGCCT TTGCGCAGGT GGCGCGGCCG GTGCCGGTGC TGGCCATCGA TGGTCCCTCG
GCCTCGGGCA AGGGCACGGT GGCGGCGCGC GTCGCCGAGA CGCTGGGCTG GCACTACCTC
GACAGCGGCT CGCTCTACCG CCTGGTGGCG CTGGCCGCGA TGCGCGCGGG AGTGGCCTTC
GACGACGAGG CCGGCGTGGC CGCGCTCGCC GCGGGTCTGC CGGCGCGCTT CGAGGGCGGG
CGGGTGCTGT TGGGAGAGGG CGCGGACCGC GACGTCACCG ACGAGATCCG CTCCGAGGCC
TGCTCGGTCG GCGCCTCGAA GGTCGCCGTG CTGCCGGCGG TGCGTGCGGC CCTGCTCGAC
CGCCAGCGCG ACTACCGCGC GGCCCCCGGC CTGGTGGCCG AAGGCCGCGA CATGGGCTCG
GTGATCTTCC CCGATGCAGG GCTCAAGGTC TTCCTCACCG CCTCGGCCGA GGCGCGCGCC
GAACGCCGCC ATAAGCAGTT GATCGAAAAG GGTTTGGCTG CTAACATGGA AAATCTTCTG
AAAGACCTTC AGGAACGGGA TGCGCGCGAT GCCGCCCGGC CTGTGGCGCC ACTCGTGAAG
CTGCCGGATG CGGCCCTGCT CGACACCACT CAACGGGACG TCGACGAGGC CGTCGCCTTC
GTTCTCGATC TCGTCGGGCG GGGGGGCAAG TCGTCCGGTT GA
 
Protein sequence
MEFLDLPPML GAAGSVRLPG SKSISNRVLL LAALAEGETD IRDLLLSDDV ERMLEALRAL 
GVDWRREGDS LNYRVCGVGG PFPVKTGDLF LGNAGTAFRP LTAALALSGG EYRLSGVPRM
HERPIGDLVD ALRQLGADIT CTANEGYPPL HLKPATIRPG GVVRVRGDVS SQFLTALLMA
LPLTGVETTI EVVGELISKP YIRITLELMA RFGVQVGQQG WERFVVPGGA RYRSPGTVFV
EGDASSASYF LAAGAIGGGP VRVEGVGRTS IQGDVRFAEA LEQLGARITL GDNWIEAAAP
AGGVLKAFDL DLNHIPDAAM TLAVAALFAD GPCRLRNIAS WRVKETDRIA AMATELRKLG
AEVEEGADYL VVQRPPRLQP AAIDTYDDHR MAMCFSLASL GGCRVRINDP KCVNKTFPGY
FEAFAQVARP VPVLAIDGPS ASGKGTVAAR VAETLGWHYL DSGSLYRLVA LAAMRAGVAF
DDEAGVAALA AGLPARFEGG RVLLGEGADR DVTDEIRSEA CSVGASKVAV LPAVRAALLD
RQRDYRAAPG LVAEGRDMGS VIFPDAGLKV FLTASAEARA ERRHKQLIEK GLAANMENLL
KDLQERDARD AARPVAPLVK LPDAALLDTT QRDVDEAVAF VLDLVGRGGK SSG