Gene Tmz1t_0762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0762 
Symbol 
ID7084153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp846613 
End bp848043 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content67% 
IMG OID643697787 
ProductRadical SAM domain protein 
Protein accessionYP_002354429 
Protein GI217969195 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.502838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGCCG TCTTGAATCT GGTGCAGGAC AACCTGCACG AGCTGCGCAT CGATGAGACG 
CGCATGCTGT TCCACATCCC GAGCAGTTCG CTGTTCGCGC TCGACCCGCT GAGCGCGGCG
CTGATCGATC GCATCCGCCG TCAGAGCCTC ACCGTCGAGG CCCTGATCGA CGGCCTGCGC
ACCGAGTTCG ACGCCGACGA GGTGGCCGAG GCGATCCGCG AGCTGGTCGC GCTCGAGCTC
ATCGACGACG GCCGCCCCGC CGGCGCCGGC ACGCTCGCGC ACACCTTCGA GCGCTTCCCG
CTCACCACGG TGGTGCTCAA CGTCAACACC GGCTGCAACC TGAGCTGTAC CTACTGCTAC
AAGGAAGACC TCGACAAGCC CTCGGCGGGC CGCAAGATGG CCTTCGGGAC CGCGCGCGAC
GCGATCGAGA TGATGTTGCG CGAGTCGCCG GACGAGCCGC GCTACAACGT CGTCTTCTTC
GGCGGCGAGC CGCTCAGCAA CCTGCCGCTG ATCAAGGACG TGGTCGCGTA CTGCGAGGCG
CGCTTCGCCG AGCTCGGCAA GCAGGTCGAT TTCGTCATGA CGACGAACGC GACCCTGCTC
GCCGACGACA CCATCGACTG GCTCGATGCC CACCGCTTCG GGCTGTCGAT CAGCATCGAC
GGTCCGAAGG CGATCCACGA CCGCAACCGG CTCACCGTCG GCGGCCAGGG CACCTACGAG
ACCGTGCGGC GCAAGGCCGA GCGCCTGCTG GCGCGCTACC ACGCGCGGCC GGTGGGGGCA
CGGGTCACGC TCACCCACGG CACCACCGAG GTCGAGCGCA TCTGGGACCA CCTGTTCAAC
GAGCTGGGCT TTGCCGAAGT GGGCTTCGCG CCGGTGACCT CGGGCGACAT CAGCACCTTC
AACCTCACGG GCGCGGAGCT GGTCGAGGTC TTCGCCGGGC TGAAGCGGCT CGGTGCGCGC
TATCTGGAGG CGGCGCTGGA GGGACGCAAC ATCGGTTTCT CCAACATGCA CCAGCTGATC
ACCGACCTGC ACGAAGGCCA CAAGAAGGCG CTGCCCTGTG GCGCCGGGTT GAAGATGCTC
GCGGTCGACC ACAAGGGTGA ACTGAACCTG TGCCATCGCT TCACCGGCTC CACGCTGCCG
ACCTTCGGCG ACGTGAAGAA CGGTATCCAG CGCGCGCAGC TCGGCGATTT CCTGTCCCAG
CGCCTGGATC GCACGGATAC CGGCTGCGCG AGCTGCCGCA TCCGCAACCT GTGCTCGGGC
GGCTGCTACC ACGAGAGCTA CGCGCGCTAC GGCGATCCCG CACATCCCAC CTACCACTAC
TGCGATCTGA TGCGCGACTG GGTGGACTTC GGCATCGAGG TCTACAGCCG CATCATGGCC
GGGAACCCGG CCTTCATCGA ACAGCATATT TCCCCGAGGA GGGCGTCATG A
 
Protein sequence
MGAVLNLVQD NLHELRIDET RMLFHIPSSS LFALDPLSAA LIDRIRRQSL TVEALIDGLR 
TEFDADEVAE AIRELVALEL IDDGRPAGAG TLAHTFERFP LTTVVLNVNT GCNLSCTYCY
KEDLDKPSAG RKMAFGTARD AIEMMLRESP DEPRYNVVFF GGEPLSNLPL IKDVVAYCEA
RFAELGKQVD FVMTTNATLL ADDTIDWLDA HRFGLSISID GPKAIHDRNR LTVGGQGTYE
TVRRKAERLL ARYHARPVGA RVTLTHGTTE VERIWDHLFN ELGFAEVGFA PVTSGDISTF
NLTGAELVEV FAGLKRLGAR YLEAALEGRN IGFSNMHQLI TDLHEGHKKA LPCGAGLKML
AVDHKGELNL CHRFTGSTLP TFGDVKNGIQ RAQLGDFLSQ RLDRTDTGCA SCRIRNLCSG
GCYHESYARY GDPAHPTYHY CDLMRDWVDF GIEVYSRIMA GNPAFIEQHI SPRRAS