Gene Tmz1t_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1121 
Symbol 
ID7084650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1228344 
End bp1229687 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content53% 
IMG OID643698136 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_002354776 
Protein GI217969542 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.396392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTTT CGATTTTTGG GACTGGTTAT GTCGGCCTTG TTACTGGCGC ATGTTTGGCT 
GAGGTCGGTC ATAAGGTGGT ATGTTTGGAT ATCGACGCGG CCAAGATTGA TCGTCTTAAT
CGTGGTGAGC TGCCAATCTG GGAGCCAGGC CTGGAGGCGA TCGTTTCTCG AAACGTTGCT
GAAGGCCGCT TGGAGTTCAC CACTGACATC GCTCGCGGTG TAGCGCATGC GGATATCCAG
TTTATTGCTG TGGGTACTCC ACCTGATGAA GATGGGTCGG CGGACTTACA GTATGTACTT
TCGGTGGCGG AGAGTATTGC CCGTGAGATG AATACGTTCA AAGTGGTCGT CAATAAATCG
ACTGTGCCAG TAGGAACGGC TGACAAGGTA CGGCAGAGAA TTGAAGGGAT TCTCCGAGAG
AGAGGTGTGG CGCTGACGTT CGACGTTGTG TCTAATCCCG AGTTTCTAAA GGAAGGGGCC
GCAGTTGGGG ATTTTTTGAA GCCGGATCGT ATAATTATTG GGGCTGGCTC GGAAAATGCT
CGTAAAGTCA TGCGGGAACT CTATGAGCCT TTTAATAGGA GTCATGAGCG AACCATGTTT
ATGGATGTTC GCAGCGCGGA GCTAACGAAG TATGCGGCGA ATGCTATGCT TGCTACGAAG
ATTAGTTTCA TGAACGAGCT GGCAAACCTG TCGGAACGAC TTGGCGCTGA TATCGAGGAG
GTCCGTAAGG GCATTGGCGC CGATCCACGC ATAGGCTATC ATTTTATTTA TCCAGGTTGT
GGGTATGGGG GTAGCTGTTT TCCGAAGGAC GTACAGGCTC TTGCTCGAAT TGCGGATGAC
GTTGGGTATG AGGCGGAACT GGTTAAGGCT GTGGAGGCGG TTAATAATCG TCAGAAGAAC
GTTCTTTTCG ACAAACTGGC AAGCCGGTTC GGTGGTGCGC GGGCACTTGG CGGAAAGGTG
ATTGCGGTTT GGGGGCTTTC GTTTAAGCCG AATACGGACG ATATGCGGGA AGCTCCGAGC
AGGACGCTTT TAGAATCGCT TTGGGCCGTA GGTGCCGAGG TGAGAGCGTT TGATCCGGTT
GCGATGGAGG AGGCTCGTCG GCTCTATAGA AGTCGCGAAG CTTTTTTTCT CGCGAGCGAT
AAATATAGTT GCCTTGATGG GGTGGACGCA CTGTGTATTT GCACCGAGTG GCAGGCGTTT
CGGGCCCCCG ATTTCGACGA GATGCAGTCT CGTATGCGGG CGCGCGTGAT TATAGATGGT
CGGAATCTTT ATCACCCTGA GCGGTTGCGC GAGATGGGCT GGGTTTACGA TAGTGTTGGG
CGGCCGTCGA GTTTGGTTGC TTAG
 
Protein sequence
MRLSIFGTGY VGLVTGACLA EVGHKVVCLD IDAAKIDRLN RGELPIWEPG LEAIVSRNVA 
EGRLEFTTDI ARGVAHADIQ FIAVGTPPDE DGSADLQYVL SVAESIAREM NTFKVVVNKS
TVPVGTADKV RQRIEGILRE RGVALTFDVV SNPEFLKEGA AVGDFLKPDR IIIGAGSENA
RKVMRELYEP FNRSHERTMF MDVRSAELTK YAANAMLATK ISFMNELANL SERLGADIEE
VRKGIGADPR IGYHFIYPGC GYGGSCFPKD VQALARIADD VGYEAELVKA VEAVNNRQKN
VLFDKLASRF GGARALGGKV IAVWGLSFKP NTDDMREAPS RTLLESLWAV GAEVRAFDPV
AMEEARRLYR SREAFFLASD KYSCLDGVDA LCICTEWQAF RAPDFDEMQS RMRARVIIDG
RNLYHPERLR EMGWVYDSVG RPSSLVA