Gene Tmz1t_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1156 
Symbol 
ID7084685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1279579 
End bp1280520 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content72% 
IMG OID643698171 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002354811 
Protein GI217969577 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCC GCGCCCATCA TGTGCTGATC GGCCTCTTCA CCGTGCTCGT CGTCGGCGCC 
GCGCTGATGT TCGCGCTGTG GCTGGGCAAG AGCGACGCCG ACCGCCAGTT CGAGGTGTAC
GACATCGTCT TCCAGGAGGC GGTCTCCGGC CTGTCCAAGG GTGGCACGGT GGAGTTCAAC
GGCATCAAGA TCGGCGACGT GGTCAGCCTG CGCCTCGATC CCGCCGATGC GCGCCGCGTC
ATCGCCCGCG TGCGCGTGGA CAGCGCCGCG CCGGTGCGCA GCGACACCCG CGCCCGCCTG
GTGCCTGCCG GCATCACCGG CTTGACCATG ATCCGCCTGA CCAGCGGCGA GGATCCGGCG
AGCACGCCCC TGGTGTCCAA GGGCGACGAG GTGGCGCGCA TCATCGCCGC GCCTTCGCCG
CTGAGCCGCC TGCTCGCCGA CGGCGAGGAC GCGATCACCA ACGTGAACGA CCTGCTGGTG
CAGGCGCGCG AGCTGCTCTC GGCCGACAAC GTGGCCTCGT TCGGGCGCAC GCTGGGCAAC
CTGGAGCTGG CCACCGGCGC GCTCGCCGCG CAGCGCGAGG ACCTCAACGC CGCGTTGCGC
GAGGTCACCC AGGCCAGCCG CGACGCCAGC ACTGCGCTCG CCGAGGCTGC CCGCATGCTC
GGCTCGGCCA ACAGGCTGGT GGAGGTGCAG GGCACGCAGA CCCTGGACAG CGCACGCGAC
GCGATGCGGG CCTTCGAGCG CGCGATGGGC ACGGTCGACC GCCTGATCGC CGACAACCGC
GCGCCGCTCG ACGGCGGCAT GCGCGGCCTG GCCGAGATCG GCCCCGCGGT GGCGGAGCTG
CGCACCACGC TGGCTTCGCT GCGCATCATC ACCCGCCAGC TCGAGAGCCG TCCCGCCGAC
TACCTGCTCG GCCTCGAACC GACCAAGGAG TTCACCCCGT GA
 
Protein sequence
METRAHHVLI GLFTVLVVGA ALMFALWLGK SDADRQFEVY DIVFQEAVSG LSKGGTVEFN 
GIKIGDVVSL RLDPADARRV IARVRVDSAA PVRSDTRARL VPAGITGLTM IRLTSGEDPA
STPLVSKGDE VARIIAAPSP LSRLLADGED AITNVNDLLV QARELLSADN VASFGRTLGN
LELATGALAA QREDLNAALR EVTQASRDAS TALAEAARML GSANRLVEVQ GTQTLDSARD
AMRAFERAMG TVDRLIADNR APLDGGMRGL AEIGPAVAEL RTTLASLRII TRQLESRPAD
YLLGLEPTKE FTP