Gene Tmz1t_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3801 
Symbol 
ID7874043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4193904 
End bp4194950 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content65% 
IMG OID643700743 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_002890767 
Protein GI237654453 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGCTT CAAGGTCACT ACTAATCACC GGCGGCACCG GTTCCTTCGG CAACGCCGTC 
CTCAAGCGCT TCCTCGACAC CGACATCGGC GAGATCCGCA TCTTCAGCCG TGACGAGAAG
AAGCAGGACG ACATGCGCAA GCGCTACAAC AGCGCCAAGC TCAAGTTCTA CATCGGCGAC
GTGCGAGACC AGCGCAGCGT GGAGCAGGCG ATGCGCGGGG TGGACTTCGT CTTCCACGCC
GCGGCGCTCA AGCAAGTGCC GTCGTGCGAG TTCCACCCAA TGCAGGCGGT GCGCACCAAC
GTGCTGGGCA CCGAGAACGT GCTCGAGGCG GCGATCGCCG CCGGGGTCAA GCGCGTGGTG
GTGCTGAGCA CCGACAAGGC GGTGTACCCG ATCAACGCCA TGGGCATTTC CAAGGCGATG
ATGGAGAAGG TGATGGTGGC CACCAGCCGC AACCTGGAAG GCACCGGCAC GGTGATCTGC
GGCACGCGCT ACGGCAACGT GATGGCCTCG CGCGGGTCGG TGATTCCGCT GTTCGTCGAG
CAGGTGCTGG CGGGCAAGCC GATCACCATC ACCGACCCGA GCATGACGCG CTTCATGATG
ACGCTGGCCG ATGCAGTGGA TCTGGTGCTG TATGCCTTCA CCAACGGCAA CAACGGCGAC
ATCTTCGTGC AGAAGGCGCC GGCGGCGACC ATCGAGACGC TGGCGCGCGC GGTGACCGGG
CTGATGGGCC AGCCCGCACA CCCGGTCAAC ATCATCGGCA CCCGCCATGG CGAGAAGCTC
TACGAGGCGC TGCTGAGCCG CGAGGAGCGT GCCTGCGCCG AGGACATGGG CGACTACTTC
CGCGTGCCGG CCGATGGGCG CGACCTCAAC TACGGCAAGT TCGTGGACCA GGGCGAGGCG
AAGCTGACGC AGACCACGCA CGGTGAGGAC TACAACTCGC ACAACACCAC GCGGCTGGAC
GTGGACGGCA TGACGCAGCT GCTGCTGAAG CTCGAAGGCA TGCAGCGCAT AGCCCGCGGC
GAGACGACCA CCGTCGAGGA GTGCTGA
 
Protein sequence
MFASRSLLIT GGTGSFGNAV LKRFLDTDIG EIRIFSRDEK KQDDMRKRYN SAKLKFYIGD 
VRDQRSVEQA MRGVDFVFHA AALKQVPSCE FHPMQAVRTN VLGTENVLEA AIAAGVKRVV
VLSTDKAVYP INAMGISKAM MEKVMVATSR NLEGTGTVIC GTRYGNVMAS RGSVIPLFVE
QVLAGKPITI TDPSMTRFMM TLADAVDLVL YAFTNGNNGD IFVQKAPAAT IETLARAVTG
LMGQPAHPVN IIGTRHGEKL YEALLSREER ACAEDMGDYF RVPADGRDLN YGKFVDQGEA
KLTQTTHGED YNSHNTTRLD VDGMTQLLLK LEGMQRIARG ETTTVEEC