Gene Tmz1t_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0052 
Symbol 
ID7083435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp57563 
End bp58888 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content72% 
IMG OID643697100 
Productconserved hypothetical cytosolic protein 
Protein accessionYP_002353749 
Protein GI217968515 
COG category[S] Function unknown 
COG ID[COG4924] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTGGA GCACCGCGCA GGACCTGAGG GCGCAGGTGA TGCGCCTGTG GGAGCGCGGC 
GAGCTGCTGC GCGAGGGGCT GCCGCAGGAC AGCGGCGCCC CCCCGTGCGC GTGCTTGCCG
GAGGAGGAGA GCGAGGGCTC TGGCGCTCAG CCTTGCGAGC TACCGGCCGA TGCCGCGGCG
TACACGGCCG TGGGACCACC TGCAGGCCAC AGCCGCTTCC CGCTGCGACT CACCCTGAAG
ACCCCAACCT CGGACGACAT CACCCGCCAC TTCGATGCCG TGCGCGCCTG GGTTGCGGCG
ATCTCGGCCA CGCCCCATGT GCGCCTCGAA TGGCAGGAGA CCCGCCACCG CGTGCAGGGC
AGCCAGCGCC TGCCCGCGAG CGCCTGGGTC GACCGCCTCG ACGACGCCCT GGCCTGGATC
GGCAAGCGCG CCGAGGACGC GCGCTTTCGT GCGCTGCACG CCGAGACTGC CGCACGCCAG
CCCCTGCTGC TGCCCTGGCT GCACAAGCGC CCGCTGCGCG CGCTCGAACT CGCCGCCGAG
TGGTCGCGCC TGCTCGACGT GGTGGCCTGG CTGCAGGCCC ACCCGCGCCC GGGCATGTAT
CTGCGCCAGG TCGACCTGCC CGGCATCCAC ACCAAGTTCA TCGAATCCCA GCGCGGCGTG
CTCGCCGAGC TGCTCGACCT CGCCCTGCCC GCGGCGGCGA TCGACCCGAG CCGCACCGGC
GCGCAGCAGT TCGCCGCCCG CTACGGCTTC CTGGACAAGC CCGTGCTGCT GCGCTTGCGC
ATCCTCGACC CGGCGCTCGG CCTGCTGCCT GGCGCGCCCT GCCCCGACCT CGCCCTCGAC
GCCGACAGCT TCGCCCGCCT GCGACTCGAC GTGGCGCGCG TCTTCATCAC CGAGAACGAG
ACCAACTTCC TCGCCTTCCC CCGCGTCGAC AAGGCCATCG TGATCTTCGG CGCCGGCTAC
GGCTGGGAGG CCCTCGCGCG CGCCGAGTGG CTGCAGCGCT GCCCGATCCA CTACTGGGGC
GACATCGACA CCAACGGCTT CGCCATCCTC GCCCAGCTGC GCGCCCGCTT CGCCCATGTC
GAGTCCCTGC TGATGGACCG CGCCACCCTG CTCGCACATG AGGCGCTGTG GGGCCGGGAA
GACAGCCCGC GCCCGGCCGA CGTCTCGCGC CTCAGCGCCG AGGAACGCGG CCTGTACGAA
GACCTGCGCA ACCACCACAT CCGCCCGTCC CTGCGCCTGG AGCAGGAACA CATCGGCTTC
GGCTGGCTGG AGAAGGCGCT GAGAATCGTC CACGCGGTGG ATGATTTTCA GCCATCTGAT
GGTTGA
 
Protein sequence
MSWSTAQDLR AQVMRLWERG ELLREGLPQD SGAPPCACLP EEESEGSGAQ PCELPADAAA 
YTAVGPPAGH SRFPLRLTLK TPTSDDITRH FDAVRAWVAA ISATPHVRLE WQETRHRVQG
SQRLPASAWV DRLDDALAWI GKRAEDARFR ALHAETAARQ PLLLPWLHKR PLRALELAAE
WSRLLDVVAW LQAHPRPGMY LRQVDLPGIH TKFIESQRGV LAELLDLALP AAAIDPSRTG
AQQFAARYGF LDKPVLLRLR ILDPALGLLP GAPCPDLALD ADSFARLRLD VARVFITENE
TNFLAFPRVD KAIVIFGAGY GWEALARAEW LQRCPIHYWG DIDTNGFAIL AQLRARFAHV
ESLLMDRATL LAHEALWGRE DSPRPADVSR LSAEERGLYE DLRNHHIRPS LRLEQEHIGF
GWLEKALRIV HAVDDFQPSD G