Gene Tmz1t_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0266 
Symbol 
ID7084388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp300959 
End bp302293 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content64% 
IMG OID643697307 
Productprotein of unknown function DUF21 
Protein accessionYP_002353955 
Protein GI217968721 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCG CCTTACTCAT CGCCCTCATC GTGCTCAACG GTGTCTTTGC AATGTCCGAG 
ATCGCGCTCG TCACCGCGCG CCGCGCGCGC CTCGCACGGC TTGCGGACGA CGGTGACGGC
TCGGCCGCAG TCGCCATGAA GCTCGGCGAG GATCCCACGC GCTTCCTGTC CACGATCCAG
ATCGGCATCA CCTCGATCGG CATCCTCAAC GGCATCGTCG GCGAGGCGGC CCTCGCGGGC
CCGCTGGCGG AGTGGTTGCA GACGCTGGGC ATGGAGCAGC GCACCAGCGA GATCGGATCG
ACCGTGCTGG TCGTCGTCGT CATCACTTAC GTCTCGATCG TGGTCGGCGA GCTCGTCCCC
AAGCGCATCG GCCAGATCAA TCCGGAGGGC ATCGCCCGCC TCGTAGCCCG GCCCATGAAT
GTGCTGGCCA TGGCCTCGCG CCCCTTCGTC TATCTGCTGG CCGGTTCGAC CGCGCTGCTG
CTACGTCTGA TGGGACAACG CGAGACGACT GGCCCCAGCG TGACTGAGGA AGAAATCCAC
GCGATGCTCA ATGAGGGCTC GGAAGCCGGC GTCATCGAGA AGAGCGAACA TGAGATGGTG
CGCAACGTGT TCCGCCTCGA CGATCGTCAG ATCGGCTCGC TGATGGTGCC GCGTGCCGAC
ATCGTCACCC TGGACGTGGA TCGCCCCCTC GATGAGAACC TCGCGCTGGT GGCCGAATCC
GCGCACTCGA GTTTCCCGGT GTGCCGGGAT GGGCTGGATG AGATCCTCGG CATCGTCAGC
GCCAAGCAGA TCTTCTCCCA GATGGTGCGT GGCGAGTCGG TCGACTTCAC ACAAAACCTG
CAGGCGCCGG TCTACGTGCC CGAATCGCTC ACCGGCATGG AACTGCTCGA TCAGTTCCGG
GCCTCCGGCA CGTACATCGT CTTCGTGATC GACGAGTACG GCGAGGTGCA AGGCATGGTC
ACGCTGCACG ACGTCATCGA ATCCGTGACC GGCGAGTTCC TCCCGCACGA CACGAAGGAA
TCGTGGGCCG TGCAGCGCGA GGACGGCTCC TGGCTGCTCG ATGGACTCAT CCCGATCGTC
GAGTTCAAGG ATCGCCTGGG CATCAAGGCC GTGCCCGAAG AAGAAAAGGG GCGATACCAC
ACGCTGTCGG GCATGGTGAT GTGGCTGCTC GGCCGCCTGC CCAACACCGG CGACATCGCC
ACCTGGGAGA ACTGGCGTTT CGAGGTCATC GACCTCGACG GCAAGCGCAT CGACAAGGTA
CTGGCGATGC AACGGCCGGA ACCGGCCCCT GAGACGATCG TCGAAAGCGA GTCTCAGGCG
CCTTCGCAAG CCTGA
 
Protein sequence
MEIALLIALI VLNGVFAMSE IALVTARRAR LARLADDGDG SAAVAMKLGE DPTRFLSTIQ 
IGITSIGILN GIVGEAALAG PLAEWLQTLG MEQRTSEIGS TVLVVVVITY VSIVVGELVP
KRIGQINPEG IARLVARPMN VLAMASRPFV YLLAGSTALL LRLMGQRETT GPSVTEEEIH
AMLNEGSEAG VIEKSEHEMV RNVFRLDDRQ IGSLMVPRAD IVTLDVDRPL DENLALVAES
AHSSFPVCRD GLDEILGIVS AKQIFSQMVR GESVDFTQNL QAPVYVPESL TGMELLDQFR
ASGTYIVFVI DEYGEVQGMV TLHDVIESVT GEFLPHDTKE SWAVQREDGS WLLDGLIPIV
EFKDRLGIKA VPEEEKGRYH TLSGMVMWLL GRLPNTGDIA TWENWRFEVI DLDGKRIDKV
LAMQRPEPAP ETIVESESQA PSQA