Gene Tmz1t_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2402 
Symbol 
ID7094324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011667 
Strand
Start bp63812 
End bp65776 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content59% 
IMG OID643701088 
Productprotein of unknown function DUF524 
Protein accessionYP_002364229 
Protein GI217980179 
COG category[S] Function unknown 
COG ID[COG1700] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value0.63975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGCG TTCCGCTGCA TCGCGCGAGA TTCGACCTAG GCGCGGGCTG CTGGATCGAC 
ATCGAAGCCG ACCTTGGCCA AGGCAGGCCG CTCGCCGAGG GCGTTATCCA GTCGGACTCG
CTGATCTCCG CTGAGGAAGA GACCAAGGCT GAGGAGATCT CTTGGGGCGG TATTCCGCCG
ATGTGCTGGC GTGACGCTGG AGAACTGTTC GAGAAGTTCC AGCTTCGTGA GGACACCGAC
TACTTTGTCG ACGTGGCTAC GCCCTCGTCG ATACAAGAGG CCATCCATCT GTCTCAAGAA
AATCCTACGT GGCCGTTCGA GCCTAGGCTC GCGAACGCCT TCACGCGGGA GCCAGCCAGG
CGCTGGCGGC AAGAGGGTGG CAAAACGGTG GTCACAGGGC AGATTCGCTT GCGCTCGCAC
GCCGGCGTAT TGAACCTCTC GCCCGTGTTC GGAGGAGAGG TGCGAGCCGA GGTCGCCTGT
CGAAAGCTGC GTTACTTTGA GGAGTTCAAG CGTCTGCTGG ACGAACTGGC AGAGAAGGCC
ACAGAGCTAC TGCTTTCCTA CGACAGCCCG GTTTCTCTGA ACTTTCAGAC GACTGACGAT
CTCGCAAAAA ACGAATCCGC CCTGCACTTC CTGATGCGGC ATGTGATGGC AAAGGAAAGG
CTCCCGATGT CCATCGAGGA GATCGTAGAG CGACCGCATG TGCGCTTGGT GGAACGCGTG
GAGCCCACGC CGATCGACGA AATCCAAGAA GCCGATCCAG AACTAGTCGC TGACGGGCTG
GACTACTCTG AACTCAGCCC AAGTGGTCCC TTAGCTCGCC TATTTAGAGG TTTCACGCCT
ACAGCGCTGC CTCATCGCGA GAGTTACGAA TCCCTCGATA CGCCTGAGAA CCGCTATGCC
AAAGCTTTCT TGGAGCATTG CAGCCTTGTC TCACGGCGGC TTGAAGGCGC GTTGGCCTCA
CAGGGACGGC GAGCGTCAGC GCGCGAAGCT CGCGCTTGGG GCGTGTCGCT CGACGAAGCT
TTGCAGCATG GAATGTGGCG AGATGTGGGC CCTCTCACTC AAATCCCTGC AAACTCGCAG
ACGCTCTTGC GAAAGCGCGG TTACAAGGAT CTGCTGCGCT ACGACCTTTC GTTGCGCATG
GCGCTGGAAC TCGCGTGGAA GGAAGGTGCA CAACTCTCCG ACGGACTCTC CGGCGACATC
CGCCCAGTAA ACCAGATCTA CGAGTACTGG TGCTTTTTCT GCCTTCGGGA GATCCTTCTT
TCGCTGTGCG TCGAAATTGG AGGCGGTAAC TTCCTGACCG TGAGCAAGGA CGGCCTGAAG
GTGCAACTTG CCAAGGGGGC TCGAAGCGAG TGCCGCTTCG AGTTCACAGG GGACAGCGGC
GCCAAAGTTC GCGTCTCACT CTTCTTCAAC CGCCGCTTTC GCCGCCCGAA ATCGCCGCAG
TCGGCGTGGG AGGGCAGCTA CACCGCATCT TTCGATCCCG ATTTCAGCAT CCGGCTGAGC
AAGGCCGCCG CAGACTTGCC ATCGCATTGG CTTCACTTCG ACGCTAAGTA TCGGCTCGAG
AGGCAACAGT CGGAGACCTT GTTCGAAGAA GCGCCCGACG GCGAGCAGGA TGGTGGAATA
GCTGATTACG AAGCCGAAGT GGCACGAGTG CACAAGCTTG AGGATCTCTT CAAGATGCAC
ACGTATCGGG ACGGAATCCT TGGTACACGG GGAGCCTACG TACTCTTTCC TGGTGACGGC
GTTGGAGGCA TCGTCAGCGC GCCAAAGCCC AATCTCTTTG TTCGGAACCC GGCCGCGTTT
GGCGGTACGG GGTCCCATCA AATCCCGAGT GTCGGCACCT TCGACTTGGC CCCAGGTGGT
GGCGCTGAGC AAAAGCAGGC CATCGCTTCG CTGCTGACTA GCGTACTCGA AGCAGTCGCG
GGAGCGCCTA CCTATCAAGA GGAATATGGT TACTGGACCC CGTAA
 
Protein sequence
MSCVPLHRAR FDLGAGCWID IEADLGQGRP LAEGVIQSDS LISAEEETKA EEISWGGIPP 
MCWRDAGELF EKFQLREDTD YFVDVATPSS IQEAIHLSQE NPTWPFEPRL ANAFTREPAR
RWRQEGGKTV VTGQIRLRSH AGVLNLSPVF GGEVRAEVAC RKLRYFEEFK RLLDELAEKA
TELLLSYDSP VSLNFQTTDD LAKNESALHF LMRHVMAKER LPMSIEEIVE RPHVRLVERV
EPTPIDEIQE ADPELVADGL DYSELSPSGP LARLFRGFTP TALPHRESYE SLDTPENRYA
KAFLEHCSLV SRRLEGALAS QGRRASAREA RAWGVSLDEA LQHGMWRDVG PLTQIPANSQ
TLLRKRGYKD LLRYDLSLRM ALELAWKEGA QLSDGLSGDI RPVNQIYEYW CFFCLREILL
SLCVEIGGGN FLTVSKDGLK VQLAKGARSE CRFEFTGDSG AKVRVSLFFN RRFRRPKSPQ
SAWEGSYTAS FDPDFSIRLS KAAADLPSHW LHFDAKYRLE RQQSETLFEE APDGEQDGGI
ADYEAEVARV HKLEDLFKMH TYRDGILGTR GAYVLFPGDG VGGIVSAPKP NLFVRNPAAF
GGTGSHQIPS VGTFDLAPGG GAEQKQAIAS LLTSVLEAVA GAPTYQEEYG YWTP