Gene Tmz1t_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3074 
Symbol 
ID7874544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3328327 
End bp3329457 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID643699997 
Productchorismate synthase 
Protein accessionYP_002890049 
Protein GI237653735 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGCA ACACCCTCGG CACGCTCTTC ACCGTCACCT CCTTCGGGGA ATCCCACGGC 
CCGGCGATCG GCTGCATCGT CGATGGCTGC CCGCCGGGAC TGGCGATCTG CGAGGCAGAC
ATCCAGGCCG AGCTCGACCG CCGCAAGCCG GGCACCTCGC GCCACGTCAC CCAGCGCCGC
GAGCCCGACA CCGTCGAGAT CCTCTCCGGC GTGTTCGAGG GCGTCACCAC CGGCACCCCG
ATCTCGCTGC TGATCCGCAA CCAGGACCAG CGCAGCAAGG ACTACGGCAA CATCGCCGAC
ACCTTCCGCC CCGGCCACGC CGACTACGCC TACCTGCAGA AGTACGGCCT GCGCGACCAT
CGTGGCGGCG GGCGCTCGTC GGCGCGCGAG ACCGCGGTGC GGGTGGCGGC CGGCGCGATC
GCGAAGAAGT GGCTGAAGGA GCGCCACGGC ATCGTCATCC GCGCCTGCAT GGGTGCGCTC
GGCCCGATCG AGATTCCCTT CGTGTCCTGG GACGAGGTCG ACGGCAACCC CTTCTTCGCG
CCCAACGCCG CGATCGTGCC CGAGCTCGAG GCCTTCATGG ACGCGCTGCG CAAGTCGGGC
GACTCGATCG GCGCGCGCAT CGACGTGGTC GCCAGCGGCG TCCCGGTCGG CTGGGGCGAG
CCGGTGTATG GCCGCCTGGA CGCCGACATC GCCTATGCGA TGATGGGCAT CAACGCGGTC
AAGGGCGTGG AGATCGGCGC CGGGTTCAAG TCGGTCGCGC AGCGCGGCAC CGAGCACGGC
GACGAGATGA CGCCCGCGGG CTTCCTGTCC AATCATGCGG GCGGCGTGCT CGGTGGCATC
TCCACCGGGC AGGACATTCT CGCCAGCATC GCGATCAAGC CGACCTCGAG CATCCGCCTC
GAGCGTCGCT CGATCGACCG CGCAGGCAAT CCCGTCATGG TCGCCACCGA GGGTCGCCAC
GACCCCTGCG TGGGCATCCG CGCGACGCCG ATCGCCGAAT CCATGCTCGC ACTGGTGCTG
ATCGACCACG CGCTGCGCCA TCGCGCGCAG TGCGGCGACG TGCGCACCGA CACCCCGCGC
ATCCCGGCGC TCGCGCCCGC AGGGAGCCAG CGCCTGCCTT CGCCGCGCTG A
 
Protein sequence
MSGNTLGTLF TVTSFGESHG PAIGCIVDGC PPGLAICEAD IQAELDRRKP GTSRHVTQRR 
EPDTVEILSG VFEGVTTGTP ISLLIRNQDQ RSKDYGNIAD TFRPGHADYA YLQKYGLRDH
RGGGRSSARE TAVRVAAGAI AKKWLKERHG IVIRACMGAL GPIEIPFVSW DEVDGNPFFA
PNAAIVPELE AFMDALRKSG DSIGARIDVV ASGVPVGWGE PVYGRLDADI AYAMMGINAV
KGVEIGAGFK SVAQRGTEHG DEMTPAGFLS NHAGGVLGGI STGQDILASI AIKPTSSIRL
ERRSIDRAGN PVMVATEGRH DPCVGIRATP IAESMLALVL IDHALRHRAQ CGDVRTDTPR
IPALAPAGSQ RLPSPR