Gene Tmz1t_2504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2504 
Symbol 
ID7873943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2706507 
End bp2707520 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content71% 
IMG OID643699426 
Productaminodeoxychorismate lyase 
Protein accessionYP_002889483 
Protein GI237653169 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.917927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTT TTCTCCCCCG CCTGCTCCTG CTGCTCGCCG TGCTTGCCAT CCTTGCCGCG 
ATGATCGCGG CGACGGGCTG GTGGTATGCG CACCGGCCGC TCGCGCTCGC CGCCGAGCGG
GTGGATTTCA CCGTGGCCCG CGGCATGGGC ATGCGCCAGG CCGCCGCCGC CATCGAGCGC
GCCGGCGTGG GCGTGGATGC GCGCCTGCTC GCGCTGCTCG CGCGTCTGAC GAAGCGCGAC
GCCCGCATCA AGGCCGGCAG CTACGAGGTG CACGCCGGCA TCACGCCCTG GCAGCTCATC
CTCAAGCTCT CCGACGGCGA CGTCACGCAG GGCGAGCTGT TGCTGGTCGA GGGCTGGACC
TTCCGCCAGG TGCGCCAGGC GCTGGAGTCC CATCCGGATC TGGAAGCCGA CACCGCCGGG
CTGGGCGAGG CGGAGATCCT CGCGCGCATC GGCGCGAGCG CGCAGAACGC CGAGGGCCTC
TTCTTCCCCG ATACCTATCT GTTCGACAAG CGCTCGGGCG CGCTCGCCGT GCTGCGACGC
GCGCACGAGG CCATGCAGGC CCGCCTCGAC AAGGCCTGGG CCGAGCGCGA CCCGGCCACG
CCGCTGGCCT CGCCCTACGA GGCGCTGATC CTCGCCTCGA TCGTCGAGAA GGAGACCGGC
CGCCCCGAGG ACCGCGCCCT GGTCGCCTCG GTGTTCGCCA ACCGGCTGCG CATCGGCATG
CGTCTGCAGA CCGACCCCAC GGTGATCTAC GGCCTCGGCC CCGAGTTCGA CGGCCGCCTG
CGCCGGGCGC ATCTCGATGC CGACCACCCG TGGAACACCT ACACCCGTGC CGGCCTGCCG
CCGACGCCGA TCGCGATGCC GGGCGAGGCC GCGCTGCGCG CTGCGCTCAA ACCCGAGAAG
AGCGACTTCC TTTATTTCGT CGCGCGCGGC GACGGCAGCA GCGAGTTTTC GCGCGACCTC
GCGGCGCACA ATCGCGCCGT CGATAAATAC ATCCGCAACG GAGGTGGGGG ATGA
 
Protein sequence
MKRFLPRLLL LLAVLAILAA MIAATGWWYA HRPLALAAER VDFTVARGMG MRQAAAAIER 
AGVGVDARLL ALLARLTKRD ARIKAGSYEV HAGITPWQLI LKLSDGDVTQ GELLLVEGWT
FRQVRQALES HPDLEADTAG LGEAEILARI GASAQNAEGL FFPDTYLFDK RSGALAVLRR
AHEAMQARLD KAWAERDPAT PLASPYEALI LASIVEKETG RPEDRALVAS VFANRLRIGM
RLQTDPTVIY GLGPEFDGRL RRAHLDADHP WNTYTRAGLP PTPIAMPGEA ALRAALKPEK
SDFLYFVARG DGSSEFSRDL AAHNRAVDKY IRNGGGG