Gene Tmz1t_2972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2972 
Symbol 
ID7874362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3218010 
End bp3219737 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content66% 
IMG OID643699893 
Productprotein of unknown function DUF1302 
Protein accessionYP_002889948 
Protein GI237653634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.715393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAC ATCTGAAGAA ACTGGTGCTG GGGCTGGCCA TGGCCGGCAT CTGTGCGCCG 
GCGGCGTACG CCTTCCAGTT CGAGGCCGGC TCGGTGCGCG GTTCGCTCGA TTCCACGCTG
ACCGCGGGCT TCGGCAAGCG CGTCGAGGGC CGCGAGTGCA GCCACCACGG CGATCAATCC
ACCCCCTGCG GCGCCGATGC CAACACCGCG GTATGGGGCA ACGGCGACGA CGGCAATCTC
AACTACGACA AGGGCGACCT GTTCACCACC CACCTCAAGG GCTCGCACGA GCTGCTGCTG
AGCTTTCCCG ACGAGTGGAA GTTCATGGGC CGGGTGGGCT GGCTGTACGA CTTCGTCGCC
GACGACACCG CGCGCACCCC GCTCGACGGC GACGCCAAGC AGGCGGTCGG CCAGTACGTG
CGCCTCTACG ACCTGTGGGT GAGCAAGGAA TTCGACCTCG GCGAGCGCCG CGGCCGCGTG
CGCTTCGGCA ACCAGGTGGT GAGCTGGGGC GAGAGCCTGT TCATGATCGG CGGCATCAAC
TCCAACGTGG CGATGGACCT GCAGCGCCTG TCCTCGCCCG GCGTGCAGCT GAAGGAGGCC
TTCCTGCCCT CGCCGGCGCT GTCGGTCGCC GCCAACCTCG GCAGCGGAGT GAACGTCGAG
GCCTACTACC AGTTCCAGTG GAAGCCGTAC AGCTTCCCGC CGGCGGGGAC GTATTTCTCG
CAGAGCGATT TCTTCGACAA GGGGCGTGAC AGCCTGATCT ATTTCACGCC GGATGCCGCC
AGCCGGATTC GTACGCCGGG TGATCCGCTT TACGGCAAGA CCCTCGCCGC GGCTCGGGAA
GAAATGCTGA ACAGTGAACT CACGGCGATC CCGGTCGACG CCGACGACGA GCCGGGCGAC
TCCGGCCAGT TCGGCGTCTC GCTGAAGTAC CGGCCCGAGG GCATGGACGT CGATCTCGGC
CTCTACTACC AGCGCTTCCA CGACAAGACC CCCAACCTTC AATACTGGAA CGCGAACGGT
GCGGCTCGCA TGTACTTTCT GGAGGACCGC GAGCTCTACG GCATCAGCGC CAACACCTCG
GTGGGCAACT GGGCGGTGGG CGCCGAGTTG TCGTATCGGC CGGAGGATGC GGTTTCGCTC
GGGGCCTGCC AGGACATCGG CTTCGGGTCA CCCGATCAGT GCGACGGCTC GATCGACGAG
GAGCGCTACC AGTTCCACCT CACCGGCATC CTCAGCCTGA CGCCCGGCGA CCACGGCTGG
TTCCTCGACC TGATGGGCGC GCAGACCGGC ACCTTCCTTG GCGAGGCGGT GGCGATCGCC
TATCCGGGGG TGAGCAAGGA CAAGGCCTAC GTGCGCACCC GCAACGGCGT GACCTACCAG
CAACTCCCCG CCGCCGGCGC ATGGACGCAC GACAGCGGCG TCGGCGACAA GCTGTCCTGG
GGCTACATGT TCGACTTCAG CCTGACCTAC GACAGCACCC TGATCCCGGG CTGGCAGGTG
ATCCCGGGGG TGTTCTTCTC GCATGCGGTC AATGGCAACA CGCCCAACTT CATGGCCAAC
TGGATGGAGG GGGCGAAGTC GGCCAACTTC TATGTGCTGT TCAACCGCAA CCCGATGACC
TGGCAGGCCG GCATCAACTA CACCCGGTTC TGGGGCGGCG ACTACAGCTT CAGCGCGCCC
TACAAGGATC GCGACTTCGT CGGCGGCTTC ATCTCGCGCA ACTTCTGA
 
Protein sequence
MTTHLKKLVL GLAMAGICAP AAYAFQFEAG SVRGSLDSTL TAGFGKRVEG RECSHHGDQS 
TPCGADANTA VWGNGDDGNL NYDKGDLFTT HLKGSHELLL SFPDEWKFMG RVGWLYDFVA
DDTARTPLDG DAKQAVGQYV RLYDLWVSKE FDLGERRGRV RFGNQVVSWG ESLFMIGGIN
SNVAMDLQRL SSPGVQLKEA FLPSPALSVA ANLGSGVNVE AYYQFQWKPY SFPPAGTYFS
QSDFFDKGRD SLIYFTPDAA SRIRTPGDPL YGKTLAAARE EMLNSELTAI PVDADDEPGD
SGQFGVSLKY RPEGMDVDLG LYYQRFHDKT PNLQYWNANG AARMYFLEDR ELYGISANTS
VGNWAVGAEL SYRPEDAVSL GACQDIGFGS PDQCDGSIDE ERYQFHLTGI LSLTPGDHGW
FLDLMGAQTG TFLGEAVAIA YPGVSKDKAY VRTRNGVTYQ QLPAAGAWTH DSGVGDKLSW
GYMFDFSLTY DSTLIPGWQV IPGVFFSHAV NGNTPNFMAN WMEGAKSANF YVLFNRNPMT
WQAGINYTRF WGGDYSFSAP YKDRDFVGGF ISRNF